Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Study on the EDA-based Classification of News Text

View through CrossRef
Abstract At present, the commonly used word vector methods for news text classification mostly adopt Word2Vec, Glove, Bert and other word vector models, ignoring the remote context connections of Chinese text itself. TextCNN, RNN, BiLSTM and other neural network classification models lack the extraction of important information features of the text, and the word dependence inside the text is not strong, resulting in inaccurate classification results. To solve the above problems, this paper proposes the DPCNN-Attention news text classification model based on ERNIE’s pre-training model(EDA for short, the following general). The DPCNN neural network model with Mish() activation function is adopted to obtain the maximum length of semantic association between long distance texts in news texts. By adding attention mechanism into EDA model, in the feature extraction process, according to the importance of words to the classification results, different weights are assigned to them to enhance the word dependence relationship within the text, thus greatly improving the classification accuracy. The EDA model was experimentally verified on the THUCNews dataset. The results showed that the EDA model improved by about 6% compared with BERT’s pre-training model, and 0.4% compared with ERNIE’s pre-training model. The loss rate decreased by 0.2 compared with BERT and 0.01 compared with ERNIE’s.
Title: A Study on the EDA-based Classification of News Text
Description:
Abstract At present, the commonly used word vector methods for news text classification mostly adopt Word2Vec, Glove, Bert and other word vector models, ignoring the remote context connections of Chinese text itself.
TextCNN, RNN, BiLSTM and other neural network classification models lack the extraction of important information features of the text, and the word dependence inside the text is not strong, resulting in inaccurate classification results.
To solve the above problems, this paper proposes the DPCNN-Attention news text classification model based on ERNIE’s pre-training model(EDA for short, the following general).
The DPCNN neural network model with Mish() activation function is adopted to obtain the maximum length of semantic association between long distance texts in news texts.
By adding attention mechanism into EDA model, in the feature extraction process, according to the importance of words to the classification results, different weights are assigned to them to enhance the word dependence relationship within the text, thus greatly improving the classification accuracy.
The EDA model was experimentally verified on the THUCNews dataset.
The results showed that the EDA model improved by about 6% compared with BERT’s pre-training model, and 0.
4% compared with ERNIE’s pre-training model.
The loss rate decreased by 0.
2 compared with BERT and 0.
01 compared with ERNIE’s.

Related Results

Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Easy Data Augmentation untuk Data yang Imbalance pada Konsultasi Kesehatan Daring
Easy Data Augmentation untuk Data yang Imbalance pada Konsultasi Kesehatan Daring
Pendekatan augmentasi teks sering digunakan untuk menangani imbalance data pada kasus klasifikasi teks, seperti teks Konsultasi Kesehatan Daring (KKD), yaitu alodokter.com. Teknik ...
Easy Data Augmentation untuk Data yang Imbalance pada Konsultasi Kesehatan Daring
Easy Data Augmentation untuk Data yang Imbalance pada Konsultasi Kesehatan Daring
Pendekatan augmentasi teks sering digunakan untuk menangani imbalance data pada kasus klasifikasi teks, seperti teks Konsultasi Kesehatan Daring (KKD), yaitu alodokter.com. Teknik ...
Bounds on the sum of broadcast domination number and strong metric dimension of graphs
Bounds on the sum of broadcast domination number and strong metric dimension of graphs
Let [Formula: see text] be a connected graph of order at least two with vertex set [Formula: see text]. For [Formula: see text], let [Formula: see text] denote the length of an [Fo...
Adherence of synovial cells on EDA‐containing fibronectin
Adherence of synovial cells on EDA‐containing fibronectin
AbstractObjective. To investigate the role of EDA‐containing fibronectin (EDA+ FN), a splice variant of FN detectable in association with cellular transformation, in the adherence ...
Ectodysplasin signaling via Xedar is required for mammary gland morphogenesis
Ectodysplasin signaling via Xedar is required for mammary gland morphogenesis
ABSTRACT The Ectodysplasin A2 receptor (XEDAR), is a member of the tumor necrosis factor receptor subfamily and is a mediator of the Ectodysplasi...

Back to Top