Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

ENHANCED ENSEMBLE CLASSIFICATION TECHNIQUES FOR ACCURATE SPAM DETECTION IN E-MAIL COMMUNICATIONS

View through CrossRef
The exponential rise in email usage has paralleled an increase in unsolicited spam messages, posing significant threats such as phishing, malware dissemination, and personal data breaches. Detecting spam accurately is crucial to protect users and ensure efficient communication. Despite the development of various machine learning approaches, single classifiers often fail to generalize across diverse email datasets due to overfitting or lack of robustness. Ensemble learning, which combines multiple models, offers potential advantages in improving spam detection rates and reducing false positives. This study proposes a hybrid ensemble classification framework incorporating Bagging, Boosting (AdaBoost), and Voting techniques to classify email messages as spam or ham (non-spam). A preprocessed dataset is vectorized using TF-IDF, and multiple classifiers including Decision Trees, Naive Bayes, and Support Vector Machines are employed. Ensemble strategies are then used to enhance predictive performance through majority voting and weighted aggregation. The proposed ensemble model significantly outperforms standalone classifiers in terms of accuracy, precision, recall, and F1-score. Experimental evaluations on the widely-used SpamAssassin and Enron datasets demonstrate consistent improvements, with the Voting ensemble achieving up to 96.8% accuracy and lower false positive rates compared to existing methods.
Title: ENHANCED ENSEMBLE CLASSIFICATION TECHNIQUES FOR ACCURATE SPAM DETECTION IN E-MAIL COMMUNICATIONS
Description:
The exponential rise in email usage has paralleled an increase in unsolicited spam messages, posing significant threats such as phishing, malware dissemination, and personal data breaches.
Detecting spam accurately is crucial to protect users and ensure efficient communication.
Despite the development of various machine learning approaches, single classifiers often fail to generalize across diverse email datasets due to overfitting or lack of robustness.
Ensemble learning, which combines multiple models, offers potential advantages in improving spam detection rates and reducing false positives.
This study proposes a hybrid ensemble classification framework incorporating Bagging, Boosting (AdaBoost), and Voting techniques to classify email messages as spam or ham (non-spam).
A preprocessed dataset is vectorized using TF-IDF, and multiple classifiers including Decision Trees, Naive Bayes, and Support Vector Machines are employed.
Ensemble strategies are then used to enhance predictive performance through majority voting and weighted aggregation.
The proposed ensemble model significantly outperforms standalone classifiers in terms of accuracy, precision, recall, and F1-score.
Experimental evaluations on the widely-used SpamAssassin and Enron datasets demonstrate consistent improvements, with the Voting ensemble achieving up to 96.
8% accuracy and lower false positive rates compared to existing methods.

Related Results

Spam Review Detection Techniques: A Systematic Literature Review
Spam Review Detection Techniques: A Systematic Literature Review
Online reviews about the purchase of products or services provided have become the main source of users’ opinions. In order to gain profit or fame, usually spam reviews are written...
Rezensionen
Rezensionen
Gerd Althoff, „Selig sind, die Verfolgung ausüben“. Päpste und Gewalt im Hochmittelalter. Stuttgart, Konrad Theiss Verlag 2013, 254 S. (Wendelin Knoch: Hattingen, E-Mail: wendelin....
Rezensionen
Rezensionen
Michael Altripp, Die Basilika in Byzanz (Lutz Rickelt: Münster, E-Mail: l.rickelt@uni-muenster.de)Klaus Böldl, Götter und Mythen des Nordens (Michael Dallapiazza: Prato/Urbino, E-M...
Feature Selection based on Improved Differential Evolution (DE) Algorithm for E-mail Classification
Feature Selection based on Improved Differential Evolution (DE) Algorithm for E-mail Classification
Spam e-mail has become a pervasive nuisance in today's digital world, posing significant challenges to efficient communication and information dissemination. Dealing with huge amou...
A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering
A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering
In this paper, we propose a novel Collaborative Reputation-based Vector Space Model (CRVSM) for detection of spam email. CRVSM uses a vector space model for representing the featur...
A Generalized Two-Level Ensemble Method for Spam Mail Detection
A Generalized Two-Level Ensemble Method for Spam Mail Detection
Email is the most cost-effective way to communicate with people across the world.  It offers a simple and convenient way to send and receive messages. However, it is susceptible to...
An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods
An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods
Abstract Since the advent of email services, spam emails are a major concern because users’ security depends on the classification of emails as ham or spam. It’s a malware ...
Research of Email Classification based on Deep Neural Network
Research of Email Classification based on Deep Neural Network
Abstract The effective distinction between normal email and spam, so as to maximize the possible of filtering spam has become a research hotspot currently. Naive bay...

Back to Top