Skip to content

sonu-k/Information-Security

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Phishing Website Detection using Ensemble methods of Machine Learning

The goal of this project is to investigate the efficacy of Machine Learning algorithms using bagging and boosting Ensemble methods and selecting a combination of features that would increase the accuracy in detecting phishing URL's as either 'Legitimate' or 'Phishing'.

The dataset is taken from Kaggle where the legitimate websites are taken from Yahoo and Starting point directory (Whitelists) and the phishing websites are collected from Phishtank data archive (Blacklists) where suspicious websites are submitted and verified - https://www.kaggle.com/akashkr/phishing-website-dataset
The dataset consists of 11055 URLs and 32 features. There are 6157 legitimate and 4898 phishing websites.

Programming Language used:

  • Python

Libraries used:

  • Pandas
  • Numpy
  • MatPlotLib
  • Seaborn
  • scikit-learn

Models implemented:

  • Logistic Regression
  • Random Forest Bagging algorithm
  • XGBoost Classifier
  • AdABoost Classifier

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published