What question did this study set out to answer?

The study aims to evaluate the performance of zero-shot stance detection using a Large Language Model on social media content.

February 14, 2026Open Access

Zero-shot stance detection in practice: insights on training, prompting, and decoding with a capable lightweight LLM

Key Points

The study aims to evaluate the performance of zero-shot stance detection using a Large Language Model on social media content.
Utilized FlanT5-XXL model for stance detection on tweets.
Analyzed performance using SemEval 2016 Tasks 6A, 6B, and P-Stance datasets.
Explored various prompts and decoding strategies to assess model sensitivities and biases.
Conducted qualitative analysis to identify ML model limitations.
Zero-shot approach performs comparably or better than fine-tuned methods.
Identified sensitivity to prompt instructions and decoding strategies.
Found a positivity bias affecting performance measurement and inference.
Revealed overconfidence in stance assignment in certain cases, leading to misclassifications.

Abstract

We investigate the performance of Large Language Model (LLM)-based zero-shot stance detection on tweets. Using FlanT5-XXL, an instruction-tuned open-source LLM, with the SemEval 2016 Tasks 6A, 6B, and P-Stance datasets, we analyze how its performance varies under different prompts and decoding strategies, as well as potential model biases. We show that the zero-shot approach can match or outperform state-of-the-art methods, including fine-tuned models. Additionally, we provide practical insights into its performance, including sensitivity to instructions and prompts, decoding strategies, prompt perplexity, and the role of negations and oppositions. We ensure that the LLM has not been trained on test datasets and identify a positivity bias that may partially explain performance differences across decoding strategies. Finally, we conduct a qualitative analysis of cases where the LLM consistently fails, uncovering questionable ground truth labels and an overconfidence in assigning a stance when none exists. In sum, we provide an in-depth case study of using an LLM for a stance detection task, which can serve as a guide for practitioners seeking to leverage LLMs for similar tasks and use cases.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Rachith Aiyappa

Indiana University Bloomington

Shruthi Senthilmani

Indiana University Bloomington

Jisun An

Indiana University Bloomington

Journals

PeerJ Computer Science

Actions

Institutions

University of Virginia

Indiana University Bloomington

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Zero-shot stance detection in practice: insights on training, prompting, and decoding with a capable lightweight LLM

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study