Javascript must be enabled to continue!
VNSED: Vietnamese spam email detection using multi deep learning models
View through CrossRef
Email is one of the most popular communication methods today. However, a high percentage of spam emails are used for various purposes. Therefore, detecting spam emails and proposing solutions to limit spam are necessary. There are many current studies related to spam detection. Deep learning models have been utilized in numerous related studies to detect spam and achieve high accuracy. However, these deep learning models are mostly trained on English datasets. Adding test datasets for Vietnamese spam emails is essential to build spam detection models not only in English but also in Vietnamese. This study presents the construction of a Vietnamese spam email dataset and the proposed system named VNSED, which uses deep learning models including CNN (Convolutional Neural Network), BiLSTM (Bidirectional Long Short-Term Memory), and PhoBERT to detect Vietnamese spam email. Experimental results show that these deep learning models all achieve high accuracy in detecting Vietnamese spam emails. Specifically, the accuracy of the models are CNN: 88.42%, BiLSTM: 83.03%, and PhoBERT: 86.47%, respectively.
Publishing House for Science and Technology, Vietnam Academy of Science and Technology (Publications)
Title: VNSED: Vietnamese spam email detection using multi deep learning models
Description:
Email is one of the most popular communication methods today.
However, a high percentage of spam emails are used for various purposes.
Therefore, detecting spam emails and proposing solutions to limit spam are necessary.
There are many current studies related to spam detection.
Deep learning models have been utilized in numerous related studies to detect spam and achieve high accuracy.
However, these deep learning models are mostly trained on English datasets.
Adding test datasets for Vietnamese spam emails is essential to build spam detection models not only in English but also in Vietnamese.
This study presents the construction of a Vietnamese spam email dataset and the proposed system named VNSED, which uses deep learning models including CNN (Convolutional Neural Network), BiLSTM (Bidirectional Long Short-Term Memory), and PhoBERT to detect Vietnamese spam email.
Experimental results show that these deep learning models all achieve high accuracy in detecting Vietnamese spam emails.
Specifically, the accuracy of the models are CNN: 88.
42%, BiLSTM: 83.
03%, and PhoBERT: 86.
47%, respectively.
Related Results
Spam Review Detection Techniques: A Systematic Literature Review
Spam Review Detection Techniques: A Systematic Literature Review
Online reviews about the purchase of products or services provided have become the main source of users’ opinions. In order to gain profit or fame, usually spam reviews are written...
Perbandingan Kinerja Algoritma Naïve Bayes Dan C.45 Dalam Klasifikasi Spam Email
Perbandingan Kinerja Algoritma Naïve Bayes Dan C.45 Dalam Klasifikasi Spam Email
Antispam dengan algoritma tertentu yang dapat memisahkan antara spam-mail dengan non spam mail. Perbandingan kinerja antara algoritma naïve bayes, dan decision tree yang memakai al...
Research of Email Classification based on Deep Neural Network
Research of Email Classification based on Deep Neural Network
Abstract
The effective distinction between normal email and spam, so as to maximize the possible of filtering spam has become a research hotspot currently. Naive ...
The determinants of consumer behavior towards email advertisement
The determinants of consumer behavior towards email advertisement
PurposeThe aim of this study was to develop a theoretical model of email advertising effectiveness and to investigate differences between permission‐based email and spamming. By ex...
A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering
A Collaborative Reputation-Based Vector Space Model for Email Spam Filtering
In this paper, we propose a novel Collaborative Reputation-based Vector Space Model (CRVSM) for detection of spam email. CRVSM uses a vector space model for representing the featur...
Email Spam Classifier
Email Spam Classifier
Communication plays a major part in everything be it proficient or individual. Because of its widespread use, accessibility, affordability, and free services, email is a popular co...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Analysis of the Application of Machine Learning Algorithm in Spam Detection System: Literature Review
Analysis of the Application of Machine Learning Algorithm in Spam Detection System: Literature Review
Spam detection is an evolving issue in line with the increasing volume of data and the evolution of spam techniques. In recent years, the application of machine learning (ML) algor...

