Skip to content

pddiii/What-makes-a-Hall-of-Famer

Repository files navigation

National Baseball Hall of Fame Status Prediction Model using Random Forest and Gradient Boosted Decision Trees.

Contributors

Commits made post 12/16/2023 are done by Peter D. DePaul III

  • Peter D. DePaul III - Data Cleaning, Batter's Models, FanGraphs Batting and Pitching models, and Final Report

  • Nelson Duong - Pitcher's Models & Exploratory Data Analysis

  • Jeffrey Gutierrez - Final Report

  • Yuji Kusuyama - Exploratory Data Analysis

  • Alan Wong - Final Report

Resources

Data Collection

Baseball Reference Model Data

The "Data Cleaning.R" file contains the data cleaning, and feature engineering process for both the Baseball Reference and FanGraphs Model.

FanGraphs Model Data

Data Dictionary

Baseball Reference Model

Batter's Dictionary

Pitcher's Dictionary

FanGraphs Model

Batter's Dictionary

Pitcher's Dictionary

Full Project Report

Baseball Hall of Fame Prediction