Skip to content

MichaelRGrant/HCDS512_A1

Repository files navigation

HCDS 513 Assingment A1

Michael Grant

License: MIT

This project's goal was to scrape the wikimedia API and plot the mobile, desktop and total page counts and page views. This was done for the years spanning 2008/01/01 - 2017/09/30, when available, from the English Wikipedia site.

The data was segregated into two types, pagecounts and pageviews. The former lists all views of the site including web crawlers or spiders. These are programs that are written to visit pages and usually scrape, or extract, needed information automatically. The latter contains views only by a human.

In order to replicate this plot, make sure that all needed packages are installed (they are listed in the notebook) and then simply run the notebook as is. Enjoy!


The final data cotains has the folowing schema and a description of each variable follows.

  • year: the four digit year the data was collected
  • month: the two digit month the data was collected
  • pagecount_all_views: the total views for both humans and spiders/crawlers alike
  • pagecount_desktop_views: the desktop views for both humans and spiders/crawlers alike
  • pagecount_mobile_views: the mobile views for both humans and spiders/crawlers alike
  • pageview_all_views: the total views for only humans
  • pageview_desktop_views: the deskptop views for only humans
  • pageview_mobile_views: the mobile views for only humans

Source:

Wikimedia Foundation

Wikimedia API Docs for PageViews

Wikimedia API Docs for PageCounts

About

Homework A1 for HCDS 512

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published