Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods

View through CrossRef
Abstract Since the advent of email services, spam emails are a major concern because users’ security depends on the classification of emails as ham or spam. It’s a malware attack that has been used for spear phishing, whaling, clone phishing, website forgery, and other harmful activities. However, various ensemble Machine Learning (ML) algorithms used for the detection and filtering of spam emails have been less explored. In this research, we offer a ML based optimized algorithm for detecting spam emails that have been enhanced using Hyper-parameter tuning approaches. The proposed approach uses two feature extraction modules, namely Count-Vectorizer and TFIDF-Vectorizer that provide the most effective classification results when we applied them to three different publicly available email data sets: Ling Spam, UCI SMS Spam, and Proposed dataset. Moreover, to extend the performance of classifiers we used various ML methods such as Naive Bayes (NB), Logistic Regression (LR), Extra Tree, Stochastic Gradient Descent (SGD), XG-Boost, Support Vector Machine (SVM), Random Forest (RF), Multi Layer Perception (MLP), and parameter optimization approaches such as Manual search, Random search, Grid search, and Genetic algorithm. For all three data sets, the SGD outperformed other algorithms. All of the other ensembles (Extra Tree, RF), linear models (LR, Linear-SVC), and MLP performed admirably, with relatively high precision, recall, accuracies and F1-Score.
Title: An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods
Description:
Abstract Since the advent of email services, spam emails are a major concern because users’ security depends on the classification of emails as ham or spam.
It’s a malware attack that has been used for spear phishing, whaling, clone phishing, website forgery, and other harmful activities.
However, various ensemble Machine Learning (ML) algorithms used for the detection and filtering of spam emails have been less explored.
In this research, we offer a ML based optimized algorithm for detecting spam emails that have been enhanced using Hyper-parameter tuning approaches.
The proposed approach uses two feature extraction modules, namely Count-Vectorizer and TFIDF-Vectorizer that provide the most effective classification results when we applied them to three different publicly available email data sets: Ling Spam, UCI SMS Spam, and Proposed dataset.
Moreover, to extend the performance of classifiers we used various ML methods such as Naive Bayes (NB), Logistic Regression (LR), Extra Tree, Stochastic Gradient Descent (SGD), XG-Boost, Support Vector Machine (SVM), Random Forest (RF), Multi Layer Perception (MLP), and parameter optimization approaches such as Manual search, Random search, Grid search, and Genetic algorithm.
For all three data sets, the SGD outperformed other algorithms.
All of the other ensembles (Extra Tree, RF), linear models (LR, Linear-SVC), and MLP performed admirably, with relatively high precision, recall, accuracies and F1-Score.

Related Results

Spam Review Detection Techniques: A Systematic Literature Review
Spam Review Detection Techniques: A Systematic Literature Review
Online reviews about the purchase of products or services provided have become the main source of users’ opinions. In order to gain profit or fame, usually spam reviews are written...
Research of Email Classification based on Deep Neural Network
Research of Email Classification based on Deep Neural Network
Abstract The effective distinction between normal email and spam, so as to maximize the possible of filtering spam has become a research hotspot currently. Naive bay...
A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering
A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering
In this paper, we propose a novel Collaborative Reputation-based Vector Space Model (CRVSM) for detection of spam email. CRVSM uses a vector space model for representing the featur...
A Generalized Two-Level Ensemble Method for Spam Mail Detection
A Generalized Two-Level Ensemble Method for Spam Mail Detection
Email is the most cost-effective way to communicate with people across the world.  It offers a simple and convenient way to send and receive messages. However, it is susceptible to...
Feature Selection based on Improved Differential Evolution (DE) Algorithm for E-mail Classification
Feature Selection based on Improved Differential Evolution (DE) Algorithm for E-mail Classification
Spam e-mail has become a pervasive nuisance in today's digital world, posing significant challenges to efficient communication and information dissemination. Dealing with huge amou...
ENHANCED ENSEMBLE CLASSIFICATION TECHNIQUES FOR ACCURATE SPAM DETECTION IN E-MAIL COMMUNICATIONS
ENHANCED ENSEMBLE CLASSIFICATION TECHNIQUES FOR ACCURATE SPAM DETECTION IN E-MAIL COMMUNICATIONS
The exponential rise in email usage has paralleled an increase in unsolicited spam messages, posing significant threats such as phishing, malware dissemination, and personal data b...
Email Spam Detection Using Machine Learning
Email Spam Detection Using Machine Learning
Email Spam has become a major problem nowadays, with Rapid growth of internet users, Email spams is also increasing. People are using them for illegal and unethical conducts, phish...
Towards a Reliable Spam Detection: An Ensemble classification with rejection option
Towards a Reliable Spam Detection: An Ensemble classification with rejection option
Abstract Many issues are faced in the email environment due to Spam, such as bottlenecks in the email gateways despite substantial investments in servers' infrastructure, w...

Back to Top