Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Research on Weibo New Word Recognition based on Weibo Data and Statistical Information

View through CrossRef
One of the key challenges in the field of Chinese information processing is the recognition of Weibo new words, which has a profound impact on machine translation and text classification. As Weibo has become the most used social platform for internet users, mining new vocabulary from Weibo data not only helps to deeply understand the data itself, but also provides personalized recommendation services for users. Although a large amount of research has focused on the recognition of Weibo new words, specialized research in this field is still scarce. In this article, we propose a Weibo new word recognition strategy that combines Weibo content features and statistical information. Firstly, extract repetitive vocabulary from Weibo topic names, and then use various methods such as absolute frequency, relative frequency, mutual information, and information entropy to filter for incorrect vocabulary. The experimental results show that by setting appropriate thresholds, incorrect vocabulary can be effectively filtered out, thereby improving recognition performance.
Title: Research on Weibo New Word Recognition based on Weibo Data and Statistical Information
Description:
One of the key challenges in the field of Chinese information processing is the recognition of Weibo new words, which has a profound impact on machine translation and text classification.
As Weibo has become the most used social platform for internet users, mining new vocabulary from Weibo data not only helps to deeply understand the data itself, but also provides personalized recommendation services for users.
Although a large amount of research has focused on the recognition of Weibo new words, specialized research in this field is still scarce.
In this article, we propose a Weibo new word recognition strategy that combines Weibo content features and statistical information.
Firstly, extract repetitive vocabulary from Weibo topic names, and then use various methods such as absolute frequency, relative frequency, mutual information, and information entropy to filter for incorrect vocabulary.
The experimental results show that by setting appropriate thresholds, incorrect vocabulary can be effectively filtered out, thereby improving recognition performance.

Related Results

Spoken Word Recognition
Spoken Word Recognition
The core question that spoken word recognition research attempts to address is: How does a phonological word-form activate the corresponding lexical representation that is stored i...
How does Weibo keep users hooked? A Weibo addictive behavior study based on netnography
How does Weibo keep users hooked? A Weibo addictive behavior study based on netnography
Purpose With the rapid development of social media in the past few years, some dark aspects of usage have appeared, e.g., Weibo addiction. Therefore, the purpose of this paper is t...
Features of China's new media(by example of the Weibo service)
Features of China's new media(by example of the Weibo service)
в настоящей статье рассматриваются особенности новых китайских медиа, которые представляют собой полотно разнообразных сайтов, ориентированных на реализацию различных потребностей ...
Trajectories of and spatial variations in HPV vaccine discussions on Weibo, 2018-2023: a deep learning analysis
Trajectories of and spatial variations in HPV vaccine discussions on Weibo, 2018-2023: a deep learning analysis
SummaryResearch in contextEvidence before this studyWe first searched PubMed for articles published until November 2023 with the keywords “(“HPV”) AND (“Vaccine” or “Vaccination”) ...
Machine Users Detection on Sina Weibo Platform
Machine Users Detection on Sina Weibo Platform
In recent years, the rapid development of Sina Weibo has made it the representative of many Weibo platforms in China. Sina Weibo has attracted large numbers of users in China becau...
Analysis of Theme Characteristics and Emotional Tendency in Weibo Discourse of People with Autism Spectrum Disorder
Analysis of Theme Characteristics and Emotional Tendency in Weibo Discourse of People with Autism Spectrum Disorder
Weibo serves as an important platform for the general public to obtain information about people with autism spectrum disorder. This study focused on 33 popular Weibo trends related...
The Existential and Anthropological Semantics of the Word in Late 17th-Century Sermons
The Existential and Anthropological Semantics of the Word in Late 17th-Century Sermons
This article describes the semantics of the word concept, which is represented in late 17th-century homiletic texts. It is defined by the topics of sermons in terms of their ontolo...
Identifying Links Between Latent Memory and Speech Recognition Factors
Identifying Links Between Latent Memory and Speech Recognition Factors
Objectives: The link between memory ability and speech recognition accuracy is often examined by correlating summary measures of performance across various tasks, but i...

Back to Top