Javascript must be enabled to continue!
Assessing Sarcasm Dataset Quality
View through CrossRef
Abstract
Artificial intelligence (AI) models depend on high-quality data to maintain accuracy and ensure safe deployment. However, the presence of sarcasm in sentiment analysis (SA) poses a unique challenge due to its inherently ambiguous and context-dependent nature, significantly impacting model performance. In this context, sarcasm detection plays a pivotal role in improving SA accuracy. While significant effort has been exerted, most existing sarcasm detection systems face substantial challenges due to poorly annotated datasets and the inherently complex nature of sarcastic language. To address this, we evaluate sarcasm data quality by benchmarking uniformly parameterized models across four distinct datasets: SARC, SemEval2022, NewsHeadline, and Multimodal. We conduct extensive evaluations using a three-model hierarchy: statistical machine learning, deep learning, and transfer learning models, alongside TF-IDF vectorization and word embeddings for text representation.To mitigate bias arising from class imbalance and unequal data distribution, we applied two resampling techniques—oversampling and undersampling—before conducting our experiments. Our findings reveal that the NewsHeadline dataset achieves superior performance, with RoBERTa attaining an F1-score of 0.93. Based on these insights, we compile and release a refined Sarcasm-Quality (SQ) dataset to advance future research in sarcasm-aware NLP systems.
Title: Assessing Sarcasm Dataset Quality
Description:
Abstract
Artificial intelligence (AI) models depend on high-quality data to maintain accuracy and ensure safe deployment.
However, the presence of sarcasm in sentiment analysis (SA) poses a unique challenge due to its inherently ambiguous and context-dependent nature, significantly impacting model performance.
In this context, sarcasm detection plays a pivotal role in improving SA accuracy.
While significant effort has been exerted, most existing sarcasm detection systems face substantial challenges due to poorly annotated datasets and the inherently complex nature of sarcastic language.
To address this, we evaluate sarcasm data quality by benchmarking uniformly parameterized models across four distinct datasets: SARC, SemEval2022, NewsHeadline, and Multimodal.
We conduct extensive evaluations using a three-model hierarchy: statistical machine learning, deep learning, and transfer learning models, alongside TF-IDF vectorization and word embeddings for text representation.
To mitigate bias arising from class imbalance and unequal data distribution, we applied two resampling techniques—oversampling and undersampling—before conducting our experiments.
Our findings reveal that the NewsHeadline dataset achieves superior performance, with RoBERTa attaining an F1-score of 0.
93.
Based on these insights, we compile and release a refined Sarcasm-Quality (SQ) dataset to advance future research in sarcasm-aware NLP systems.
Related Results
Sarcasm Types in Meghan Trainor’s Song Entitled “Mother”
Sarcasm Types in Meghan Trainor’s Song Entitled “Mother”
The research aim is to figure out types of sarcasm used in Meghan Trainor’s song entitled “Mother”. The descriptive qualitative method is used in this research. In analyzing the da...
Sarcasm in Iraqi Political Interviews
Sarcasm in Iraqi Political Interviews
Quintilian defined the standard view of sarcasm, or verbal irony, as speech in which we comprehend something that is the complete opposite of what is said. However, This study aime...
Sarcasm Detection Algorithms
Sarcasm Detection Algorithms
In this paper, we want to review one of the challenging problems for the opinion mining task, which is sarcasm detection. To be able to do that, many researchers tried to explore s...
Automatic sarcasm detection in Arabic tweets: resources and approaches
Automatic sarcasm detection in Arabic tweets: resources and approaches
Sentiment analysis has become a prevalent issue in the research community, with researchers employing data mining and artificial intelligence approaches to extract insights from te...
Sarcasm Detection: A Comparative Analysis of RoBERTa-CNN vs RoBERTa-RNN Architectures
Sarcasm Detection: A Comparative Analysis of RoBERTa-CNN vs RoBERTa-RNN Architectures
Increasingly advanced technology and the creation of social media and the internet can become a forum for people to express things or opinions. However, comments or views from user...
SARCASM Classifier
SARCASM Classifier
Sarcasm is a form of verbal irony where the intended meaning of a statement differs from its literal meaning. Detecting sarcasm is crucial for understanding sentiments and opinions...
Sarcasm Detection in News Headline Dataset with Ensemble Deep Learning Method
Sarcasm Detection in News Headline Dataset with Ensemble Deep Learning Method
Sarcasm, a prevalent linguistic device, is frequently used in public discourse, often causing offence and distress to the listener. The complexity inherent in detecting sarcasm is ...
Analisis Bahasa Sarkasme Pada Komentar Netizen Terhadap Pemberitaan Rohingya Di Akun Tiktok Tribun Bogor
Analisis Bahasa Sarkasme Pada Komentar Netizen Terhadap Pemberitaan Rohingya Di Akun Tiktok Tribun Bogor
In an era of ever-growing technological advances, social media has become an increasingly important communication platform for society. With the ability to send messages that only ...

