clone of The Pudding repo + video tutorial, Wiki-Death
Clone the repo and run npm i
Download HTMl for the year event pages from wiki that contain a list of all notable deaths (2015-2018)
Extract every person and their meta data from the event pages to create a csv of all notable deaths
Use wiki pageviews api to get the daily pageviews for en.wikipedia.org so that we can calculate percent of traffic for each person and look into seasonality in the data
Use the wiki pageviews api to get the daily pageviews for each person in our full list of notable deaths
Join all three sources into a single csv for each person that contains the daily pageviews, percent traffic, and metadata
Calculates summary statistics for each person's pageviews
Reduces data to people who satisfy some quantitative criteria
Gets more details info on each person
Consolidates and optimizes data for web presentation