phising_domain_detector

Project Description 📄

❄️ To predict whether the domains are real or malicious.

Deployed Website: (https://phishing-detector-rqld.onrender.com/)

Jupyter Notebook: (https://github.com/shiv0112/phishing_domain_detector/blob/main/research/model.ipynb)

Data:

:Phishing Websites Dataset

https://data.mendeley.com/datasets/72ptz43s9v/1

These data consist of a collection of legitimate as well as phishing website instances. Each website is represented by the set of features which denote, whether website is legitimate or not. Data can serve as an input for machine learning process.

In this repository the two variants of the Phishing Dataset are presented.

Full variant - dataset_full.csv
Short description of the full variant dataset:
Total number of instances: 88,647
Number of legitimate website instances (labeled as 0): 58,000
Number of phishing website instances (labeled as 1): 30,647
Total number of features: 111

Small variant - dataset_small.csv
Short description of the small variant dataset:
Total number of instances: 58,645
Number of legitimate website instances (labeled as 0): 27,998
Number of phishing website instances (labeled as 1): 30,647
Total number of features: 111

I trained this model using Random Forest:

Selected features

Metrics of best model used:

Grid Search Cross-validation on Random Forest:

The ROC Curve for Random Forest:

Demo Video:

Page of Website:

Data Input from user:

Authors

Rishabh: [email protected]

Shivansh Srivastava: [email protected]

Ashish Diwakar: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.dvc		.dvc
.github/workflows		.github/workflows
configs		configs
project_documentation		project_documentation
research		research
screenshots		screenshots
server		server
src/phishing_domain_detector		src/phishing_domain_detector
tests		tests
.dvcignore		.dvcignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
init_setup.sh		init_setup.sh
params.yaml		params.yaml
pipe.sh		pipe.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
template.py		template.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

phising_domain_detector

Deployed Website: (https://phishing-detector-rqld.onrender.com/)

Jupyter Notebook: (https://github.com/shiv0112/phishing_domain_detector/blob/main/research/model.ipynb)

Data:

Selected features

Metrics of best model used:

Grid Search Cross-validation on Random Forest:

The ROC Curve for Random Forest:

Demo Video:

Page of Website:

Data Input from user:

About

Releases

Packages

Contributors 3

Languages

License

shiv0112/phishing_domain_detector

Folders and files

Latest commit

History

Repository files navigation

phising_domain_detector

Deployed Website: (https://phishing-detector-rqld.onrender.com/)

Jupyter Notebook: (https://github.com/shiv0112/phishing_domain_detector/blob/main/research/model.ipynb)

Data:

Selected features

Metrics of best model used:

Grid Search Cross-validation on Random Forest:

The ROC Curve for Random Forest:

Demo Video:

Page of Website:

Data Input from user:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages