Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Development and Evaluation of Gold Standard Dataset for Sentiment Analysis of Tweets

View through CrossRef
Pre-labeled data is typically required for supervised machine learning. A limited number of object classes in the majority of open access and pre-annotated datasets make them unsuitable for certain tasks, even though they are readily available for training machine learning algorithms. For custom models, previously available pre-annotated data is typically insufficient, so gathering and preparing training data is necessary for the majority of real-world applications. The quantity and quality of annotations clearly trade-off with one another. Either more annotated data can be produced or better data quality can be guaranteed by allocating time and resources. Development of the gold standard by annotating textual information is an essential part of the “Text Analytics” domain in the field of “Natural Language Processing-NLP”. In “Text Analytics”, annotation can be done by adopting a manual, semi-automatic or automatic approach. In the case of the manual approach, annotators often work with partial parts of the corpus, and the results are generalized by automated text classification which may affect the final classification results. Annotations reliability and suitability of assigned labels are particularly important in the NLP applications related to opinion mining or sentiment analysis. In this research study, we have evaluated the significance of the annotation process on a novel dataset that contained multiple languages (English, Roman Urdu), a free text dataset that was extracted from Twitter. This unique dataset contained multiple languages which makes this annotation process essential for researching this data. Using this multi-language dataset, we examine the inter-annotator agreement in multiclass and multi-label sentiment annotation. To scrutinize the reliability of this research work, several annotation agreement metrics, statistical analysis, and Machine Learning methods have been considered to evaluate the accuracy of resulting annotations. It was observed that the annotation process is significant and a complex step that is essential for the proper implementation of Natural Language processing tasks for text analytics in machine learning. During this research, different gaps were identified and resolved which can impact the overall reliability of the annotation process which are reported in this paper. We conclude that while inaccurate annotations worsen the results, the impact is minimal, at least when using text data. The advantages of the larger annotated data set (obtained by employing subpar auto-annotation techniques) surpass the degradation resulting from the use of annotated data.
Title: Development and Evaluation of Gold Standard Dataset for Sentiment Analysis of Tweets
Description:
Pre-labeled data is typically required for supervised machine learning.
A limited number of object classes in the majority of open access and pre-annotated datasets make them unsuitable for certain tasks, even though they are readily available for training machine learning algorithms.
For custom models, previously available pre-annotated data is typically insufficient, so gathering and preparing training data is necessary for the majority of real-world applications.
The quantity and quality of annotations clearly trade-off with one another.
Either more annotated data can be produced or better data quality can be guaranteed by allocating time and resources.
Development of the gold standard by annotating textual information is an essential part of the “Text Analytics” domain in the field of “Natural Language Processing-NLP”.
In “Text Analytics”, annotation can be done by adopting a manual, semi-automatic or automatic approach.
In the case of the manual approach, annotators often work with partial parts of the corpus, and the results are generalized by automated text classification which may affect the final classification results.
Annotations reliability and suitability of assigned labels are particularly important in the NLP applications related to opinion mining or sentiment analysis.
In this research study, we have evaluated the significance of the annotation process on a novel dataset that contained multiple languages (English, Roman Urdu), a free text dataset that was extracted from Twitter.
This unique dataset contained multiple languages which makes this annotation process essential for researching this data.
Using this multi-language dataset, we examine the inter-annotator agreement in multiclass and multi-label sentiment annotation.
To scrutinize the reliability of this research work, several annotation agreement metrics, statistical analysis, and Machine Learning methods have been considered to evaluate the accuracy of resulting annotations.
It was observed that the annotation process is significant and a complex step that is essential for the proper implementation of Natural Language processing tasks for text analytics in machine learning.
During this research, different gaps were identified and resolved which can impact the overall reliability of the annotation process which are reported in this paper.
We conclude that while inaccurate annotations worsen the results, the impact is minimal, at least when using text data.
The advantages of the larger annotated data set (obtained by employing subpar auto-annotation techniques) surpass the degradation resulting from the use of annotated data.

Related Results

Sentiment Analysis of Tweets on Soda Taxes
Sentiment Analysis of Tweets on Soda Taxes
Context: As a primary source of added sugars, sugar-sweetened beverage (SSB) consumption may contribute to the obesity epidemic. A soda tax is an excise tax charged on ...
Evaluation of Medical Confidentiality Breaches on Twitter Among Anesthesiology and Intensive Care Health Care Workers
Evaluation of Medical Confidentiality Breaches on Twitter Among Anesthesiology and Intensive Care Health Care Workers
BACKGROUND: With the generalization of social network use by health care workers, we observe the emergence of breaches in medical confidentiality. Our objective was to ...
Sentiment Analysis of Russia-Ukraine Conflict Tweets Using RoBERTa
Sentiment Analysis of Russia-Ukraine Conflict Tweets Using RoBERTa
[Objective] The moment Russia officially invaded Ukraine, the world experienced a period of tension and uncertainty. As a social release valve digital communication, channels incre...
Sentiment Analysis with Python: A Hands-on Approach
Sentiment Analysis with Python: A Hands-on Approach
Sentiment Analysis is a rapidly growing field in Natural Language Processing (NLP) that aims to extract opinions, emotions, and attitudes expressed in text. It has a wide range o...
Study of the Yahoo-yahoo Hash-tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Study of the Yahoo-yahoo Hash-tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Abstract BackgroundSocial media opinion has become a medium to quickly access large, valuable, and rich details of information on any subject matter within a short period. ...
Study of the Yahoo-Yahoo Hash-Tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Study of the Yahoo-Yahoo Hash-Tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Mining opinion on social media microblogs presents opportunities to extract meaningful insight from the public from trending issues like the “yahoo-yahoo” which in Nigeria, is syno...
#Menopause: The Menopause Ontology Project
#Menopause: The Menopause Ontology Project
ABSTRACT Introduction Medical professionals and patients increasingly utilize social media to connect and share healthcare infor...
SA-MAIS: Hybrid automatic sentiment analyser for stock market
SA-MAIS: Hybrid automatic sentiment analyser for stock market
Sentiment analysis of stock-related tweets is a challenging task, not only due to the specificity of the domain but also because of the short nature of the texts. This work propose...

Back to Top