This is a codebase for a small heart disease classification project undertaken for a graduate class EN.625.740.81.FA17 Data Mining.
The data source is from the UCI repository, available here:
My primary interest with this project is to experimentally explore the importance of selecting appropriate features, unlearnt model parameters and other unlearnt stuff.
To take advantage of as many high level tools as possible, I attempted to comply with all relevant sklearn conventions