- Level Difficulty in Candy Crush Saga
- Analyze data from the hit mobile game, Candy Crush Saga.
- Topics related: Data Manipulation, Data Visualization, Importing & Cleaning Data.
- R packages used: dplyr, ggplot2
- Rise and Fall of Programming Languages
- Analyze the relative popularity of programming languages over time based on Stack Overflow data.
- Topics related: Data Manipulation, Data Visualization, Importing & Cleaning Data.
- R packages used: dplyr, ggplot2
- Predict Taxi Fares with Random Forests
- Use regression trees and random forests to find places where New York taxi drivers earn the most.
- Topics related: Supervised Learning in R (Regression), Data Manipulation with dplyr, Introduction to Data Visualization with ggplot2.
- R packages used: tidyverse, ggmap, viridis, tree, lubridate, randomForest
- What Makes a Pokémon Legendary?
- Use tree-based machine learning methods to identify the characteristics of legendary Pokémon.
- Topics related: Supervised Learning in R (Classification), Tree-Based Models in R, Introduction to the Tidyverse
- R packages used: tidyverse, rpart, rpart.plot, randomForest, ROCR
- Analyze International Debt Statistics
- Write SQL queries to answer interesting questions about international debt using data from The World Bank.
- Topics related: Data Manipulation, Importing & Cleaning Data in SQL
- What and Where Are the World's Oldest Businesses?
- Use SQL data manipulation and joins to discover the oldest businesses around the world.
- Topics related: Data Manipulation in SQL