Javascript must be enabled to continue!

An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods

Abstract Since the advent of email services, spam emails are a major concern because users’ security depends on the classification of emails as ham or spam. It’s a malware attack that has been used for spear phishing, whaling, clone phishing, website forgery, and other harmful activities. However, various ensemble Machine Learning (ML) algorithms used for the detection and filtering of spam emails have been less explored. In this research, we offer a ML based optimized algorithm for detecting spam emails that have been enhanced using Hyper-parameter tuning approaches. The proposed approach uses two feature extraction modules, namely Count-Vectorizer and TFIDF-Vectorizer that provide the most effective classification results when we applied them to three different publicly available email data sets: Ling Spam, UCI SMS Spam, and Proposed dataset. Moreover, to extend the performance of classifiers we used various ML methods such as Naive Bayes (NB), Logistic Regression (LR), Extra Tree, Stochastic Gradient Descent (SGD), XG-Boost, Support Vector Machine (SVM), Random Forest (RF), Multi Layer Perception (MLP), and parameter optimization approaches such as Manual search, Random search, Grid search, and Genetic algorithm. For all three data sets, the SGD outperformed other algorithms. All of the other ensembles (Extra Tree, RF), linear models (LR, Linear-SVC), and MLP performed admirably, with relatively high precision, recall, accuracies and F1-Score.

Springer Science and Business Media LLC

Rubab Fatima Muhammad Sadiq Saleem Ullah Gulnaz Ahmed Saqib Mahmood

2023

Title: An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods

Description:

Abstract Since the advent of email services, spam emails are a major concern because users’ security depends on the classification of emails as ham or spam.

It’s a malware attack that has been used for spear phishing, whaling, clone phishing, website forgery, and other harmful activities.

However, various ensemble Machine Learning (ML) algorithms used for the detection and filtering of spam emails have been less explored.

In this research, we offer a ML based optimized algorithm for detecting spam emails that have been enhanced using Hyper-parameter tuning approaches.

The proposed approach uses two feature extraction modules, namely Count-Vectorizer and TFIDF-Vectorizer that provide the most effective classification results when we applied them to three different publicly available email data sets: Ling Spam, UCI SMS Spam, and Proposed dataset.

Moreover, to extend the performance of classifiers we used various ML methods such as Naive Bayes (NB), Logistic Regression (LR), Extra Tree, Stochastic Gradient Descent (SGD), XG-Boost, Support Vector Machine (SVM), Random Forest (RF), Multi Layer Perception (MLP), and parameter optimization approaches such as Manual search, Random search, Grid search, and Genetic algorithm.

For all three data sets, the SGD outperformed other algorithms.

All of the other ensembles (Extra Tree, RF), linear models (LR, Linear-SVC), and MLP performed admirably, with relatively high precision, recall, accuracies and F1-Score.

Back

Online reviews about the purchase of products or services provided have become the main source of users’ opinions. In order to gain profit or fame, usually spam reviews are written...

Research of Email Classification based on Deep Neural Network

Abstract The effective distinction between normal email and spam, so as to maximize the possible of filtering spam has become a research hotspot currently. Naive bay...

A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering

In this paper, we propose a novel Collaborative Reputation-based Vector Space Model (CRVSM) for detection of spam email. CRVSM uses a vector space model for representing the featur...

A Generalized Two-Level Ensemble Method for Spam Mail Detection

Email is the most cost-effective way to communicate with people across the world. It offers a simple and convenient way to send and receive messages. However, it is susceptible to...

Feature Selection based on Improved Differential Evolution (DE) Algorithm for E-mail Classification

Spam e-mail has become a pervasive nuisance in today's digital world, posing significant challenges to efficient communication and information dissemination. Dealing with huge amou...

ENHANCED ENSEMBLE CLASSIFICATION TECHNIQUES FOR ACCURATE SPAM DETECTION IN E-MAIL COMMUNICATIONS

The exponential rise in email usage has paralleled an increase in unsolicited spam messages, posing significant threats such as phishing, malware dissemination, and personal data b...

Email Spam Detection Using Machine Learning

Email Spam has become a major problem nowadays, with Rapid growth of internet users, Email spams is also increasing. People are using them for illegal and unethical conducts, phish...

Towards a Reliable Spam Detection: An Ensemble classification with rejection option

Abstract Many issues are faced in the email environment due to Spam, such as bottlenecks in the email gateways despite substantial investments in servers' infrastructure, w...

Email:
Password:

Email:

An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods

Related Results