Phishing Website Detection using Ensemble methods of Machine Learning

The goal of this project is to investigate the efficacy of Machine Learning algorithms using bagging and boosting Ensemble methods and selecting a combination of features that would increase the accuracy in detecting phishing URL's as either 'Legitimate' or 'Phishing'.

The dataset is taken from Kaggle where the legitimate websites are taken from Yahoo and Starting point directory (Whitelists) and the phishing websites are collected from Phishtank data archive (Blacklists) where suspicious websites are submitted and verified - https://www.kaggle.com/akashkr/phishing-website-dataset
The dataset consists of 11055 URLs and 32 features. There are 6157 legitimate and 4898 phishing websites.

Programming Language used:

Python

Libraries used:

Pandas
Numpy
MatPlotLib
Seaborn
scikit-learn

Models implemented:

Logistic Regression
Random Forest Bagging algorithm
XGBoost Classifier
AdABoost Classifier

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
IS_PROJECT.ipynb		IS_PROJECT.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phishing Website Detection using Ensemble methods of Machine Learning

Programming Language used:

Libraries used:

Models implemented:

About

Releases

Packages

Languages

sonu-k/Information-Security

Folders and files

Latest commit

History

Repository files navigation

Phishing Website Detection using Ensemble methods of Machine Learning

Programming Language used:

Libraries used:

Models implemented:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages