Credit-card-fraud-detection-using-machine-learning

In this project, i analysed customer-level data that has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group.

The data set is taken from the Kaggle website and has a total of 2,84,807 transactions; out of these, 492 are fraudulent. Since the data set is highly imbalanced, it needs to be handled before model building.

Project pipeline

The project pipeline can be briefly summarised in the following four steps:

---> Data Understanding: Here, you need to load the data and understand the features present in it. This would help you choose the features that you will need for your final model. ---> Exploratory data analytics (EDA): Normally, in this step, you need to perform univariate and bivariate analyses of the data, followed by feature transformations, if necessary. For the current data set, because Gaussian variables are used, you do not need to perform Z-scaling. However, you can check whether there is any skewness in the data and try to mitigate it, as it might cause problems during the model building phase. --> Train/Test split: Now, you are familiar with the train/test split that you can perform to check the performance of your models with unseen data. Here, for validation, you can use the k-fold cross-validation method. You need to choose an appropriate k value so that the minority class is correctly represented in the test folds. --> Model building / hyperparameter tuning: This is the final step at which you can try different models and fine-tune their hyperparameters until you get the desired level of performance on the given data set. You should try and check if you get a better model by various sampling techniques. ---> Model evaluation: Evaluate the models using appropriate evaluation metrics. Note that since the data is imbalanced, it is is more important to identify the fraudulent transactions accurately than the non-fraudulent ones. Choose an appropriate evaluation metric that reflects this business goal.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
4_Credit_card_fraud_detection.ipynb		4_Credit_card_fraud_detection.ipynb
K_Nearest_Neighbor_(KNN)_on__breast_cancer_dataset.ipynb		K_Nearest_Neighbor_(KNN)_on__breast_cancer_dataset.ipynb
README.md		README.md
XGBoost_implementation_on_breast_cancer_dataset.ipynb		XGBoost_implementation_on_breast_cancer_dataset.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit-card-fraud-detection-using-machine-learning

Project pipeline

About

Releases

Packages

Languages

josenikhid97/Credit-card-fraud-detection-using-machine-learning

Folders and files

Latest commit

History

Repository files navigation

Credit-card-fraud-detection-using-machine-learning

Project pipeline

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages