Javascript must be enabled to continue!

A Study on the EDA-based Classification of News Text

Abstract At present, the commonly used word vector methods for news text classification mostly adopt Word2Vec, Glove, Bert and other word vector models, ignoring the remote context connections of Chinese text itself. TextCNN, RNN, BiLSTM and other neural network classification models lack the extraction of important information features of the text, and the word dependence inside the text is not strong, resulting in inaccurate classification results. To solve the above problems, this paper proposes the DPCNN-Attention news text classification model based on ERNIE’s pre-training model(EDA for short, the following general). The DPCNN neural network model with Mish() activation function is adopted to obtain the maximum length of semantic association between long distance texts in news texts. By adding attention mechanism into EDA model, in the feature extraction process, according to the importance of words to the classification results, different weights are assigned to them to enhance the word dependence relationship within the text, thus greatly improving the classification accuracy. The EDA model was experimentally verified on the THUCNews dataset. The results showed that the EDA model improved by about 6% compared with BERT’s pre-training model, and 0.4% compared with ERNIE’s pre-training model. The loss rate decreased by 0.2 compared with BERT and 0.01 compared with ERNIE’s.

IOP Publishing

Xu Shuwei Gao Xuyang Wang Ying

Journal of Physics: Conference Series

2021

Title: A Study on the EDA-based Classification of News Text

Description:

TextCNN, RNN, BiLSTM and other neural network classification models lack the extraction of important information features of the text, and the word dependence inside the text is not strong, resulting in inaccurate classification results.

To solve the above problems, this paper proposes the DPCNN-Attention news text classification model based on ERNIE’s pre-training model(EDA for short, the following general).

The DPCNN neural network model with Mish() activation function is adopted to obtain the maximum length of semantic association between long distance texts in news texts.

By adding attention mechanism into EDA model, in the feature extraction process, according to the importance of words to the classification results, different weights are assigned to them to enhance the word dependence relationship within the text, thus greatly improving the classification accuracy.

The EDA model was experimentally verified on the THUCNews dataset.

The results showed that the EDA model improved by about 6% compared with BERT’s pre-training model, and 0.

4% compared with ERNIE’s pre-training model.

The loss rate decreased by 0.

2 compared with BERT and 0.

01 compared with ERNIE’s.

Back

<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Sleep Habits and Occurrence of Lowback Pain among Craftsmen

<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Easy Data Augmentation untuk Data yang Imbalance pada Konsultasi Kesehatan Daring

Pendekatan augmentasi teks sering digunakan untuk menangani imbalance data pada kasus klasifikasi teks, seperti teks Konsultasi Kesehatan Daring (KKD), yaitu alodokter.com. Teknik ...

Easy Data Augmentation untuk Data yang Imbalance pada Konsultasi Kesehatan Daring

Pendekatan augmentasi teks sering digunakan untuk menangani imbalance data pada kasus klasifikasi teks, seperti teks Konsultasi Kesehatan Daring (KKD), yaitu alodokter.com. Teknik ...

Asociación entre lactancia y enfermedad diarreica aguda en niños menores de 2 años, atendidos en el centro de salud de Pueblo Nuevo- Ica en enero-marzo 2023

OBJETIVO: Determinar cuál es la asociación entre lactancia y la enfermedad diarreica aguda en niños menores de 2 años, atendidos en el centro de salud de pueblo nuevo- Ica en enero...

Bounds on the sum of broadcast domination number and strong metric dimension of graphs

Let [Formula: see text] be a connected graph of order at least two with vertex set [Formula: see text]. For [Formula: see text], let [Formula: see text] denote the length of an [Fo...

Adherence of synovial cells on EDA‐containing fibronectin

AbstractObjective. To investigate the role of EDA‐containing fibronectin (EDA+ FN), a splice variant of FN detectable in association with cellular transformation, in the adherence ...

Ectodysplasin signaling via Xedar is required for mammary gland morphogenesis

ABSTRACT The Ectodysplasin A2 receptor (XEDAR), is a member of the tumor necrosis factor receptor subfamily and is a mediator of the Ectodysplasi...

Email:
Password:

Email:

A Study on the EDA-based Classification of News Text

Related Results