What type of study is this?

This is a Cohort Study study (also classified as: Quantitative Study).

October 15, 2025

Assessing Sarcasm Dataset Quality

Key Points

The NewsHeadline dataset achieved the highest F1-score of 0.93, indicating its superior quality for sarcasm detection.
Models utilized for dataset evaluation included statistical machine learning, deep learning, and transfer learning approaches.
To combat class imbalance, two resampling techniques—oversampling and undersampling—were applied during model training.
A refined Sarcasm-Quality dataset was released to support future research in sarcasm-aware natural language processing systems.

Abstract

Abstract Artificial intelligence (AI) models depend on high-quality data to maintain accuracy and ensure safe deployment. However, the presence of sarcasm in sentiment analysis (SA) poses a unique challenge due to its inherently ambiguous and context-dependent nature, significantly impacting model performance. In this context, sarcasm detection plays a pivotal role in improving SA accuracy. While significant effort has been exerted, most existing sarcasm detection systems face substantial challenges due to poorly annotated datasets and the inherently complex nature of sarcastic language. To address this, we evaluate sarcasm data quality by benchmarking uniformly parameterized models across four distinct datasets: SARC, SemEval2022, NewsHeadline, and Multimodal. We conduct extensive evaluations using a three-model hierarchy: statistical machine learning, deep learning, and transfer learning models, alongside TF-IDF vectorization and word embeddings for text representation.To mitigate bias arising from class imbalance and unequal data distribution, we applied two resampling techniques—oversampling and undersampling—before conducting our experiments. Our findings reveal that the NewsHeadline dataset achieves superior performance, with RoBERTa attaining an F1-score of 0.93. Based on these insights, we compile and release a refined Sarcasm-Quality (SQ) dataset to advance future research in sarcasm-aware NLP systems.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Bade et al. (Mon,) studied this question.

www.synapsesocial.com/papers/68f01110f081da0584b56a1a — DOI: https://doi.org/10.21203/rs.3.rs-7541663/v1

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Authors

Girma Yohannis Bade

Olga Kolesnikova

José Luis Oropeza

Actions

Institutions

Instituto Politécnico Nacional

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Assessing Sarcasm Dataset Quality

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion