Javascript must be enabled to continue!

Detecting Phishing Websites using Decision Trees: A Machine Learning Approach

This study emphasises the value of feature selection and preprocessing in improving model performance and demonstrates the efficiency of decision trees in identifying phishing websites. Internet users are significantly threatened by phishing websites, hence a strong detection strategy is required. The Phishing Websites Dataset from the UCI Machine Learning Repository, which contains 30 website-related features, is used in the study together with a decision tree classifier from the scikit-learn package. The dataset is preprocessed to remove invalid and missing values, and the most pertinent features are chosen for model training. 80% of the dataset is utilised to train the model, while the remaining 20% is used for testing. The findings demonstrate the decision tree classifier's precision in detecting phishing websites, scoring 95.97% accurate and showing a high true positive rate (96.64%) and a negligible (3.04%) false positive rate using the confusion matrix. This study highlights the significance of feature selection and preprocessing for optimal model performance in addition to validating the efficacy of decision trees in phishing detection. The method described here can be helpful for businesses and individuals looking to protect themselves from phishing assaults, and the given data visualisations make it easier to understand datasets and assess models.

Lahore Garrison University

Ashar Ahmed Fazal Maryam Daud

International Journal for Electronic Crime Investigation

2023

Title: Detecting Phishing Websites using Decision Trees: A Machine Learning Approach

Description:

This study emphasises the value of feature selection and preprocessing in improving model performance and demonstrates the efficiency of decision trees in identifying phishing websites.

Internet users are significantly threatened by phishing websites, hence a strong detection strategy is required.

The Phishing Websites Dataset from the UCI Machine Learning Repository, which contains 30 website-related features, is used in the study together with a decision tree classifier from the scikit-learn package.

The dataset is preprocessed to remove invalid and missing values, and the most pertinent features are chosen for model training.

80% of the dataset is utilised to train the model, while the remaining 20% is used for testing.

The findings demonstrate the decision tree classifier's precision in detecting phishing websites, scoring 95.

97% accurate and showing a high true positive rate (96.

64%) and a negligible (3.

04%) false positive rate using the confusion matrix.

This study highlights the significance of feature selection and preprocessing for optimal model performance in addition to validating the efficacy of decision trees in phishing detection.

The method described here can be helpful for businesses and individuals looking to protect themselves from phishing assaults, and the given data visualisations make it easier to understand datasets and assess models.

Back

Related Results

Phishing Cyber Security Threats

Phishing is a growing threat in the realm of cybersecurity, where cybercriminals use various phishing techniques to steal sensitive information from individuals and organizations. ...

Deep Learning Based Phishing Websites Detection

Phishing is a crime that involves the theft of confidential user information. Those targeted by phishing websites include individuals, small businesses, cloud storage providers, an...

Identification of Phishing Urls Using Machine Learning

Abstract Phishing is a typical assault on unsuspecting individuals by making them to reveal their one-of-a-kind data utilizing fake sites. The target of phishing sit...

The need for education on phishing: a survey comparison of the UK and Qatar

PurposeThis paper seeks to focus on identifying the need for education to enhance awareness of the e‐mail phishing threat as the most effective way to reduce the risk of e‐mail phi...

Autonomy on Trial

Photo by CHUTTERSNAP on Unsplash Abstract This paper critically examines how US bioethics and health law conceptualize patient autonomy, contrasting the rights-based, individualist...

Review on Phishing Attack Detection using Recurrent Neural Network

Phishing is a crime that involves the theft of personal information from users. Individuals, corporations, cloud storage, and government websites are all targets for the phishing w...

Detecting Phishing Website with Machine Learning

Attacks are many types to disturb the network or any other websites. Phishing attacks (PA) are a type of attacks which attack the website and damage the website and may lose the da...

AI-Based Phishing Attack Detection And Prevention Using Natural Language Processing (NLP)

Phishing attacks remain one of the most prevalent and damaging cybersecurity threats, targeting users across various communication channels such as email, social media, and SMS. Tr...

Email:
Password:

Email: