Javascript must be enabled to continue!

Attention-enabled Multi-layer Subword Joint Learning for Chinese Word Embedding

Abstract In recent years, Chinese word embeddings have attracted significant attention in the field of natural language processing (NLP). The complex structures and diverse influences of Chinese characters present distinct challenges for semantic representation. As a result, Chinese word embeddings are primarily investigated in conjunction with characters and their subcomponents. Previous research has demonstrated that word vectors frequently fail to capture the subtle semantics embedded within the complex structure of Chinese characters. Furthermore, they often neglect the varying contributions of subword information to semantics at different levels. To tackle these challenges, we present a weight-based word vector model that takes into account the internal structure of Chinese words at various levels. The model further categorizes the internal structure of Chinese words into six layers of subword information: words, characters, components, pinyin, strokes, and structures. The semantics of Chinese words can be derived by integrating the subword information from various layers. Moreover, the model considers the varying contributions of each subword layer to the semantics of Chinese words. It utilizes an attention mechanism to determine the weights between and within the subword layers, facilitating the comprehensive extraction of word semantics. The word-level subwords act as the attention mechanism query for subwords in other layers to learn semantic bias. Experimental results show that the proposed word vector model achieves enhancements in various evaluation metrics, such as word similarity, word analogy, text categorization, and case studies.

Springer Science and Business Media LLC

Pengpeng Xue Liang Tan Jing Xiong Zhongzhu Liu Kanglong Liu

2024

Title: Attention-enabled Multi-layer Subword Joint Learning for Chinese Word Embedding

Description:

Abstract In recent years, Chinese word embeddings have attracted significant attention in the field of natural language processing (NLP).

The complex structures and diverse influences of Chinese characters present distinct challenges for semantic representation.

As a result, Chinese word embeddings are primarily investigated in conjunction with characters and their subcomponents.

Previous research has demonstrated that word vectors frequently fail to capture the subtle semantics embedded within the complex structure of Chinese characters.

Furthermore, they often neglect the varying contributions of subword information to semantics at different levels.

To tackle these challenges, we present a weight-based word vector model that takes into account the internal structure of Chinese words at various levels.

The model further categorizes the internal structure of Chinese words into six layers of subword information: words, characters, components, pinyin, strokes, and structures.

The semantics of Chinese words can be derived by integrating the subword information from various layers.

Moreover, the model considers the varying contributions of each subword layer to the semantics of Chinese words.

It utilizes an attention mechanism to determine the weights between and within the subword layers, facilitating the comprehensive extraction of word semantics.

The word-level subwords act as the attention mechanism query for subwords in other layers to learn semantic bias.

Experimental results show that the proposed word vector model achieves enhancements in various evaluation metrics, such as word similarity, word analogy, text categorization, and case studies.

Back

Byte Pair Encoding (BPE) is widely recognized as an effective approach for machine translation across multiple languages. However, in morphologically rich languages such as Korean,...

A Comparative Analysis of Word Embedding and Deep Learning for Arabic Sentiment Classification

Sentiment analysis on social media platforms (i.e., Twitter or Facebook) has become an important tool to learn about users’ opinions and preferences. However, the accuracy of senti...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Detectability of an intermediate layer by magnetotelluric sounding

Abstract The recent publication by Verma and Mallick (1979) on the detectability of an intermediate layer by time domain EM sounding provides some informative ans...

A Technique for Constructing <span class="changedDisabl

To solve the problem of constructing the frequency responses (FR) of filters on switched capacitors, which belong to the class of electronic circuits with a periodically changing s...

Računalno potpomognuto usmjeravanje kod dvojezičnih govornika

This thesis investigates whether modern computer models can confirm how people encounter words and then use these findings in didactics. In recent years, computers have been used i...

Successful Replacement Therapy After <span c

Background. Vitamin D has recognized immunomodulatory, anti-proliferative, and differentiation-regulating effects primarily mediated through its genomic effects via the vitamin D r...

'A Large Quantity of E

The succesful escape from slavery between the late 17th and the mid 19thth century depended greatly on the runaway’s skills in adapting themselves to their natural environment. Alt...

Email:
Password:

Email:

Attention-enabled Multi-layer Subword Joint Learning for Chinese Word Embedding

Related Results