Twitter scraper for Python

This is a simple API that allows you to get tweets and analyze them in two different forms:

Get word clouds of the tweets that match your search
Analyze the sentiment of tweets
Place the location of tweets on a map 💥

What it does

This API allows you to:

Get tweets from Twitter for a search term (It is possible to choose the tweets' date.)
Generate a word cloud from a dataframe of tweets and to save it to a file.
Run a sentiment analysis classifier, get a bar graph of the results, and store the results in a dataframe.
Save a dataframe of tweets to a MySQL database or a CSV file.
Read a dataframe of tweets from a CSV file.
Create a map showing the users' location for all the tweets that include such information.

How to use it

If you're getting your tweets directly from Twitter, you will have to get developer access and then provide the required info to the code via text file (see example below).

If you have your data stored in a database or a csv file, read it into a dataframe and you're good to go. Please be aware that this implementation expects a DataFrame object including a text field and a user_location field.

Requirements

In order to use this code you must have the following installed:

numpy
pandas
matplotlib.pyplot
nltk (sentiment classifier)
selenium (map functionality)
geocoder (map functionality)
geckodriver (map functionality)

Examples

Connect to twitter and get tweets matching a search term

api=connect_to_twitter("your_file_location/twitter_credentials.txt")

search_for="pfizer + vaccine"
search_query = search_for + " -filter:retweets"
start_date="2020-01-08"
until_date="2021-01-09" #no tweets on this date or later will be selected
num_tweets=100

tweets_df=get_tweets(api,search_query,start_date,until_date,num_tweets)

tweets_df is a Pandas DataFrame containing the results.

Create a word cloud for tweets

You need a Pandas DataFrame containing a text field for the tweet's text.

clean_words = clean_word(tweets_df) # clean words for word clouds

# create word cloud and save image
wcloud(clean_words,"your_file_location/filename","Word Cloud Title")

Run a sentiment classifier on the dataframe of tweets

You need a Pandas DataFrame containing a text field for the tweet's text.

# run sentiment analysis classifier
new_df=sa_tweets(tweets_df,"file_location")

new_df is an augmented version of tweets_df containing 4 additional fields with the results of the classification: neg, neu, pos, and compound. The function will also save a bar graph of the results.

Create a map of user locations

You need a Pandas DataFrame containing a user_location field.

#create a map
create_tweets_map(tweets_df,"image_filename")

The resulting map is saved to image_filename. There is no need to add an extension, the program does it automatically.

Save tweets to a database

#save to database
save_df_to_db(tweets_df,"your_location/database_credentials.txt","location/filename")

Save tweets to CSV file

# save to csv file
tweets_df.to_csv('filename.csv', index = True)

Read tweets from a CSV file

tweets_df=pd.read_csv("filename.csv")

To do

Verify the method that saves the tweets to the database.
Write code for loading tweets from database into a dataframe.
Improve preprocessing for both word clouds and sentiment analysis.
Train my own sentiment classifier.
Work on geospacial analysis (partial work done, there is a map function now 😃).

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
__pycache__		__pycache__
main		main
README.md		README.md
SAproject1.py		SAproject1.py
aux.py		aux.py
df2csv.py		df2csv.py
examples.py		examples.py
first_test.html		first_test.html
first_test.html.png		first_test.html.png
geckodriver.log		geckodriver.log
graphs.py		graphs.py
main.py		main.py
maps.py		maps.py
pretrained_model.py		pretrained_model.py
read_experiment.png		read_experiment.png
sql_hook.py		sql_hook.py
test.csv		test.csv
train.csv		train.csv
tweet_classifier.py		tweet_classifier.py
twitterhook.py		twitterhook.py
wordcloudfromtweets.py		wordcloudfromtweets.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter scraper for Python

What it does

How to use it

Requirements

Examples

Connect to twitter and get tweets matching a search term

Create a word cloud for tweets

Run a sentiment classifier on the dataframe of tweets

Create a map of user locations

Save tweets to a database

Save tweets to CSV file

Read tweets from a CSV file

To do

About

Releases

Packages

Languages

k-naranjo/sa

Folders and files

Latest commit

History

Repository files navigation

Twitter scraper for Python

What it does

How to use it

Requirements

Examples

Connect to twitter and get tweets matching a search term

Create a word cloud for tweets

Run a sentiment classifier on the dataframe of tweets

Create a map of user locations

Save tweets to a database

Save tweets to CSV file

Read tweets from a CSV file

To do

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages