Skip to content

Word frequency counts of crawled news headlines & sentiment analysis of tweets on THAAD missile crisis.

Notifications You must be signed in to change notification settings

ehsong/thaad-crisis

Repository files navigation

thaad-crisis

web crawling and word frequency counts

Word frequency counts of news headlines on THAAD in South Korean media. The notebook includes the code used to crawl the web portal Naver to query terms and collect news headlines within a time period. The note book is here.

examining shared links

The notebook contains code I used to examine tweets that included terms on China and THAAD. I examined the web sources of the news shared on twitter, and expanded bit.ly links to examine actual link sources. The notebook is here.

sentiment analysis using lexicons

To further understand sentiment towards China across two time frames, I preprocessed about 0.5 million tweet corpus and then conducted sentiment analysis by using the Korean Sentiment Lexicon 한국 감성 사전 which has 6K positive and negative unigram terms. Using the lexicon, I obtained the polarity scores for tweets that appeared before and after the missile crisis. The notebook where I preprocessed the tweets is here, the function for caculating the polarity score is here and the final plot is here.

APSA presentation slides

The goal of this research project was to examine the effects of media reporting on pollution attribution in South Korea on public sentiment towards China, and was presented at Annual Political Science Association Conference on August 2019. For a summary of results and the literature, please see the poster link here.

About

Word frequency counts of crawled news headlines & sentiment analysis of tweets on THAAD missile crisis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published