Javascript must be enabled to continue!
Research of Email Classification based on Deep Neural Network
View through CrossRef
Abstract
The effective distinction between normal email and spam, so as to maximize the possible of filtering spam has become a research hotspot currently. Naive bayes algorithm is a kind of frequently-used email classification and it is a statistical-based classification algorithm. It assumes that the attributes are independent of each other when given the target value. This hypothesis is apparently impossible in the email classification, so the accuracy of email classification based on naive bayes algorithm is low. In allusion to the problem of poor accuracy of email classification based on naive bayes algorithm, scholars have proposed some new email classification algorithms. The email classification algorithm based on deep neural network is one kind of them. The deep neural network is an artificial neural network with full connection between layer and layer. The algorithm extracted the email feature from the training email samples and constructed a DNN with multiple hidden layers, the DNN classifier was generated by training samples, and finally the testing emails were classified, and they were marked whether they were spam or not. In order to verify the effect of the email classification algorithm based on DNN, in this paper we constructed a DNN with 2 hidden layers. The number of nodes in each hidden layer was 30. When the training set was trained, we set up 2000 batches, and each batch has 3 trained data. We used the famous Spam Base dataset as the data set. The experiment result showed that DNN was higher than naive Bayes in the accuracy of email classification when the proportion of the training set was 10%, 20%, 30%, 40% and 50% respectively, and DNN showed a good classification effect. With the development of science and technology, spam manifests in many forms and the damage of it is more serious, this puts forward higher requirements for the accuracy of spam recognition. The focus of next research will be combining various algorithms to further improve the effect of email classification.
Walter de Gruyter GmbH
Title: Research of Email Classification based on Deep Neural Network
Description:
Abstract
The effective distinction between normal email and spam, so as to maximize the possible of filtering spam has become a research hotspot currently.
Naive bayes algorithm is a kind of frequently-used email classification and it is a statistical-based classification algorithm.
It assumes that the attributes are independent of each other when given the target value.
This hypothesis is apparently impossible in the email classification, so the accuracy of email classification based on naive bayes algorithm is low.
In allusion to the problem of poor accuracy of email classification based on naive bayes algorithm, scholars have proposed some new email classification algorithms.
The email classification algorithm based on deep neural network is one kind of them.
The deep neural network is an artificial neural network with full connection between layer and layer.
The algorithm extracted the email feature from the training email samples and constructed a DNN with multiple hidden layers, the DNN classifier was generated by training samples, and finally the testing emails were classified, and they were marked whether they were spam or not.
In order to verify the effect of the email classification algorithm based on DNN, in this paper we constructed a DNN with 2 hidden layers.
The number of nodes in each hidden layer was 30.
When the training set was trained, we set up 2000 batches, and each batch has 3 trained data.
We used the famous Spam Base dataset as the data set.
The experiment result showed that DNN was higher than naive Bayes in the accuracy of email classification when the proportion of the training set was 10%, 20%, 30%, 40% and 50% respectively, and DNN showed a good classification effect.
With the development of science and technology, spam manifests in many forms and the damage of it is more serious, this puts forward higher requirements for the accuracy of spam recognition.
The focus of next research will be combining various algorithms to further improve the effect of email classification.
Related Results
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...
User Motive Factors: The Rudiments to Email Satisfactory Communication
User Motive Factors: The Rudiments to Email Satisfactory Communication
Our study aims to evaluate if user motive factors, comprising; cognitive needs, social integrative needs, and personal integrative needs might influence email satisfactory communic...
Neural Networks for Quality Sorting of Agricultural Produce
Neural Networks for Quality Sorting of Agricultural Produce
The objectives of this project were to develop procedures and models, based on neural networks, for quality sorting of agricultural produce. Two research teams, one in Purdue Unive...
Fuzzy Chaotic Neural Networks
Fuzzy Chaotic Neural Networks
An understanding of the human brain’s local function has improved in recent years. But the cognition of human brain’s working process as a whole is still obscure. Both fuzzy logic ...
Protein Fold Classification using Graph Neural Network and Protein Topology Graph
Protein Fold Classification using Graph Neural Network and Protein Topology Graph
AbstractProtein fold classification reveals key structural information about proteins that is essential for understanding their function. While numerous approaches exist in the lit...
Detection of Phishing Threats Using Neural Networks
Detection of Phishing Threats Using Neural Networks
Today, the Internet is an effective channel for social interaction worldwide, but it also opens up great opportunities for cyberattacks. Recently, the number of botnets and phishin...
Improving Medical Document Classification via Feature Engineering
Improving Medical Document Classification via Feature Engineering
<p dir="ltr">Document classification (DC) is the task of assigning the predefined labels to unseen documents by utilizing the model trained on the available labeled documents...
Classifying Wheat Hyperspectral Pixels of Healthy Heads and Fusarium Head Blight Disease Using a Deep Neural Network in the Wild Field
Classifying Wheat Hyperspectral Pixels of Healthy Heads and Fusarium Head Blight Disease Using a Deep Neural Network in the Wild Field
Classification of healthy and diseased wheat heads in a rapid and non-destructive manner for the early diagnosis of Fusarium head blight disease research is difficult. Our work app...


