Skip to content

Latest commit

 

History

History
56 lines (27 loc) · 6.28 KB

README.md

File metadata and controls

56 lines (27 loc) · 6.28 KB

Introduction to Data Science

Lectures | Summary | About | Credits

Lectures

  1. R and the tidyverse [.html | .pdf | .Rmd]

  2. What is data science? [.html | .pdf | .Rmd]

  3. Version control and project management [.html | .pdf | .Rmd]

  4. Data science ethics [.html | .pdf | .Rmd]

  5. Functions and debugging [.html | .pdf | .Rmd]

  6. Databases [.html | .pdf | .Rmd]

  7. Web data and technologies [.html | .pdf | .Rmd]

  8. Web scraping and APIs [.html | .pdf | .Rmd]

  9. I2DS Tools for Data Science Workshop [website | materials]

  10. Modeling [.html | .pdf | .Rmd]

  11. Visualization [.html | .pdf | .Rmd]

  12. Automation, scheduling, and packages [.html | .pdf | .Rmd]

  13. Monitoring and communication [.html | .pdf | .Rmd]

  14. [BONUS] Working at the command line [.html | .pdf | .Rmd]

Summary

This is a course taught by Simon Munzert at the Hertie School, Berlin.

Course contents

This course will introduce you to the modern data science workflow with R. In recent years, data analysis skills have become essential for those pursuing careers in policy advocacy and evaluation, business consulting and management, or academic research in the fields of education, health, and social science. We will cover topics like version control (Git) and project management; data collection, wrangling, storage, and visualization; model fitting and simulation; advanced workflow issues, debugging, automation; and data science ethics. The course is intended for students with some experience in working with R.

Main learning objectives

The goals are to (1) equip you with conceptual knowledge about the data science pipeline and coding workflow, data structures, and data wrangling, (2) enable you to apply this knowledge with statistical software, and (3) prepare you for our other methods electives and the master’s thesis.

Credits

Many of the materials build on Grant McDermott's excellent course Data Science for Economists.