Javascript must be enabled to continue!

Machine-Learning Classification Model and Tools for Real-time URL Phishing Detection

Abstract Phishing attacks are considered a significant cybersecurity concern, employing deceptive tactics to entice individuals into engaging with counterfeit websites. These malicious pages are skillfully designed replicas of legitimate platforms, aiming to collect sensitive data like usernames, passwords, banking credentials, and other personal details. This study focuses on phishing via Uniform Resource Locators (URLs) and investigates the potential of machine learning to identify such deceptive websites based on their behavior and URL attributes. To accomplish this, the work introduces and demonstrates two key tools; one for dataset creation and the other for URL classification.Machine learning has already shown its effectiveness in identifying phishing attacks from URLs, though there are still some obstacles to be overcome, such as the need for vast quantities of high-quality training data and the requirement to keep up with the constantly changing tactics employed by phishing attackers. The integration of the proposed tools in a web browser plugin is supposed to enable real-time URL analysis within web browsers, enhancing the system's effectiveness against phishing attacks and hence improving user experience.Using a self-collected dataset of 46,000 URLs, several machine learning algorithms were trained and tested including support vector machine (SVM), XGBoost, decision tree, and random forest algorithms. Among these, XGBoost model achieved an impressive classification accuracy of 96%, F1-Score of 96.7%, Recall of 96.6% and Precision 96.9% after assessing various permutations of hyperparameter values using the grid search procedure. This success underscores the potency of machine learning techniques in bolstering cyber defenses and mitigating the impact of phishing attacks.

Springer Science and Business Media LLC

Ramzi Saifan Hani Ahmad Talal A. Edwan

2025

Title: Machine-Learning Classification Model and Tools for Real-time URL Phishing Detection

Description:

Abstract Phishing attacks are considered a significant cybersecurity concern, employing deceptive tactics to entice individuals into engaging with counterfeit websites.

These malicious pages are skillfully designed replicas of legitimate platforms, aiming to collect sensitive data like usernames, passwords, banking credentials, and other personal details.

This study focuses on phishing via Uniform Resource Locators (URLs) and investigates the potential of machine learning to identify such deceptive websites based on their behavior and URL attributes.

To accomplish this, the work introduces and demonstrates two key tools; one for dataset creation and the other for URL classification.

Machine learning has already shown its effectiveness in identifying phishing attacks from URLs, though there are still some obstacles to be overcome, such as the need for vast quantities of high-quality training data and the requirement to keep up with the constantly changing tactics employed by phishing attackers.

The integration of the proposed tools in a web browser plugin is supposed to enable real-time URL analysis within web browsers, enhancing the system's effectiveness against phishing attacks and hence improving user experience.

Using a self-collected dataset of 46,000 URLs, several machine learning algorithms were trained and tested including support vector machine (SVM), XGBoost, decision tree, and random forest algorithms.

Among these, XGBoost model achieved an impressive classification accuracy of 96%, F1-Score of 96.

7%, Recall of 96.

6% and Precision 96.

9% after assessing various permutations of hyperparameter values using the grid search procedure.

This success underscores the potency of machine learning techniques in bolstering cyber defenses and mitigating the impact of phishing attacks.

Back

Related Results

Phishing Cyber Security Threats

Phishing is a growing threat in the realm of cybersecurity, where cybercriminals use various phishing techniques to steal sensitive information from individuals and organizations. ...

A Robust Model for Phishing URL Classification and Intrusion Detection using Machine Learning Techniques

Phishing is one of the most prevalent and risky online threats. It works when hackers deceive internet users into providing personal information, such as passwords, login credentia...

Deep Learning Based Phishing Websites Detection

Phishing is a crime that involves the theft of confidential user information. Those targeted by phishing websites include individuals, small businesses, cloud storage providers, an...

Persistence and half‐life of URL citations cited in LIS open access journals

PurposeThe main purpose of the present study is to examine the availability and persistence of URL citations in two LIS open access journals. It also intended to calculate the half...

AI-Based Phishing Attack Detection And Prevention Using Natural Language Processing (NLP)

Phishing attacks remain one of the most prevalent and damaging cybersecurity threats, targeting users across various communication channels such as email, social media, and SMS. Tr...

A Study on the Best Classification Method for an Intelligent Phishing Website Detection System

It is impossible to imagine our lives without the internet, but it has also meant that malicious acts such as phishing can be carried out anonymously. Phishers use social engineeri...

A Study on the Best Classification Method for an Intelligent Phishing Website Detection System

It is impossible to imagine our lives without the internet, but it has also meant that malicious acts such as phishing can be carried out anonymously. Phishers use social engineeri...

Identification of Phishing Urls Using Machine Learning

Abstract Phishing is a typical assault on unsuspecting individuals by making them to reveal their one-of-a-kind data utilizing fake sites. The target of phishing sit...

Email:
Password:

Email: