tweets-collector

Collect tweets (tweets corpus) using Twitter API.

Collection can be based on hashtags, keywords, geographical location.

install requirements

pip install -r requirements.txt

Getting your API keys from Twitter

Go to https://apps.twitter.com and create an new app
Provide a name and describe for the app, then specify permissions
Then go to keys and access management tab
put these info in credentials.txt and in api_keys.py files.

query_tweets.py Usage

usage: query_tweets.py [-h] -k KEYWORDS_FILE -o OUTFILE -n NUMBER

collect tweets based on keywords

optional arguments:
  -h, --help            show this help message and exit
  -k KEYWORDS_FILE, --keywords-file KEYWORDS_FILE
                        keywords or hashtags file. The file should contain one
                        keyword/hashtag per line
  -o OUTFILE, --outfile OUTFILE
                        the output json file path and prefix.
  -n NUMBER, --number NUMBER
                        the number of tweets that you want to collect

json2text.py Usage

usage: json2text.py [-h] -i JSON_DIR -o OUT_DIR [--exclude-redundant]
                    [--include-id] [-n] [--remove-repeated-letters]
                    [--keep-only-arabic]

extract tweet texts from json

optional arguments:
  -h, --help            show this help message and exit
  -i JSON_DIR, --json-dir JSON_DIR
                        tweets json directory
  -o OUT_DIR, --out-dir OUT_DIR
                        the output directory.
  --exclude-redundant   exclude redundant tweets
  --include-id          include tweet id
  -n, --normalize       normalize text
  --remove-repeated-letters
                        removed repeated letters (+2 consecutive) from text
  --keep-only-arabic    only keep Arabic words

stream_geolocation.py Usage

Get Geo locations from http://boundingbox.klokantech.com/

usage: stream_geolocation.py [-h] -l GEO_LOCATIONS -j JSON -n NUMBER

collect tweets based on geographic location

optional arguments:
  -h, --help            show this help message and exit
  -l GEO_LOCATIONS, --geo-locations GEO_LOCATIONS
                        geo location coordinates from
                        http://boundingbox.klokantech.com copy and past using 
                        csv option
  -j JSON, --json JSON  the the json output file.
  -n NUMBER, --number NUMBER
                        the number of tweets that you want to collect

stream_users.py Usage

Get users id from https://tweeterid.com

usage: stream_users.py [-h] -u USERS -j JSON -n NUMBER

collect tweets based on following twitter users

optional arguments:
  -h, --help            show this help message and exit
  -u USERS, --users USERS
                        twitter user ids file. Get ids from tweeterid.com
  -j JSON, --json JSON  the the json output file.
  -n NUMBER, --number NUMBER
                        the number of tweets that you want to collect

user_tweets_history.py Usage

get the most recent tweets of a user

usage: user_tweets_history.py [-h] -u USER

emoji list

positive/negative emoji list is obtained from https://emojipedia.org/

Sentiment Analysis in Arabic tweets

Please check the article https://mksaad.wordpress.com/2018/12/07/sentiment-analysis-in-arabic-tweets-with-python/

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
png		png
twitter-files		twitter-files
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api_keys.py		api_keys.py
json2text.py		json2text.py
json2xls.py		json2xls.py
my_listener.py		my_listener.py
negative_emoji.txt		negative_emoji.txt
positive_emoji.txt		positive_emoji.txt
query_tweets.py		query_tweets.py
requirements.txt		requirements.txt
stream_geolocation.py		stream_geolocation.py
stream_users.py		stream_users.py
text2xlsx.py		text2xlsx.py
tweepy_search.py		tweepy_search.py
tweet_cleaner.py		tweet_cleaner.py
twitter_api_search.py		twitter_api_search.py
user_tweets_history.py		user_tweets_history.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tweets-collector

install requirements

Getting your API keys from Twitter

query_tweets.py Usage

json2text.py Usage

stream_geolocation.py Usage

stream_users.py Usage

user_tweets_history.py Usage

emoji list

Sentiment Analysis in Arabic tweets

About

Releases

Packages

Languages

License

motazsaad/tweets-collector

Folders and files

Latest commit

History

Repository files navigation

tweets-collector

install requirements

Getting your API keys from Twitter

query_tweets.py Usage

json2text.py Usage

stream_geolocation.py Usage

stream_users.py Usage

user_tweets_history.py Usage

emoji list

Sentiment Analysis in Arabic tweets

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages