An Apache storm analyzer for top 5 hashtags in each country using mongoDB for geoloaction calculations and heavy hitters for counting streams.
- Tweets source is twitter api (twitter 4j) is used to get tweet body and the geolocation information.
- Tweets then parsed to get the hashtages and the country name I used mongoDB driver to get the nearest country to the given geolocation information data of countries, longitude and latitude are provided I get them from google here.
- Tweets then counted using heavy hitters algorithm to get the top 5 occurrances in the stream.
- Finally visualize the result using d3 and flask python server.
You can find many maps code here