Prediction of customer churn on bank data

Abstract

In this project we would be predicting customer churn from bank data. We would be adopting 8 different classification models and apply stratifiedKfold cross validation to check the performance of the models. Stratified K Fold is used because it is best for classification problems. The main challenge would be the imbalanced data which would be handled using SMOTE before training the model. After cross validation of the models, three top performing models are taken and finally hyperparameter tuning is done using grid search to identify ideal hyperparameters for best performance

Files

churn modeling.csv

The dataset used for this project. Data Source: Kaggle.
main.ipynb

Implementation of the project - Jupyter notebook file.
Research.pdf

Detailed information about all preprocessing, implementation and the research work is available in this document.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Prediction of customer churn on bank data

Abstract

Files

Files

README.md

Latest commit

History

README.md

File metadata and controls

Prediction of customer churn on bank data

Abstract

Files