Project README

Detecting-Malicious-URL-Using-Pyspark

Development Enviroment

Apache Spark 2.3.0
Jupyter Notebook

Datasets

Datasets used in this project is manually obtained from the following sources:

Phising URLS

Phishtank - https://www.phishtank.com/developer_info.php
Open Phis - https://openphish.com/

SPAM URLS

JWSPAMSPY - http://www.joewein.de/sw/blacklist.htm

Malware URLS

Benign URLS

Majestic - https://majestic.com/reports/majestic-million

Another Usefull Source to collect Malicious URLs

https://zeltser.com/malicious-ip-blocklists/

The Dataset.csv used in this project is the combination of the above sources. A data pre-processing program is used to clean and filter the data. Thus, the dataset is already being labelled and ready to be used in the project.

Open Source Agenda is not affiliated with "Detecting Malicious URL Machine Learning" Project. README Source: rlilojr/Detecting-Malicious-URL-Machine-Learning

Stars

Open Issues

Last Commit

5 years ago

Repository

rlilojr/Detecting-Malicious-URL-Machine-Learning

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/detecting-malicious-url-machine-learning"><img src="https://www.opensourceagenda.com/projects/detecting-malicious-url-machine-learning/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog