This project is an assigment for Social Media Analysis course at Wroclaw Univ. of Science and Technology.
Prepration and analysis network of Twitter users participating in the discussion surrounding the controversy.
- understanding Twitter data
- Twitter data acquisition
- Jupyter ipython notebooks
- Python:
- data organization: pandas
- graphs: networkx
- machine learning: scikit-learn
- NLP: polyglot
- scrapping: twint
- Gephi
- Scrapping tweets from 19.10.2020 - 23.10.2020 with hashtag
#trybunalkonstytucyjny
which is related with controversional polish Constitutional Court decision concerning abortion. - Scraping all tweets of found users from 12.10.2020 - 23.10.2020.
- Users graph building - users as nodes and edges between users if one user retweeted other user's tweet at least once (26 335 nodes and 70 403 edges in graph).
- Detecting communities and computing graph layout with Gephi.
- Analysis users activities in most meaningfull communitites.
- Performing users classification task on prepared data.
Performed analysis of gained communities show that two biggest groups suggest good split for Constitutional Court decision supporters and opponents:
- red coloured community - named as Group A in analysis - decision supporters,
- green coloured community - named as Group B in analysis - decision opponents.
HTML version of notebook with full analysis is available here.