What question did this study set out to answer?

February 17, 2026Open Access

Target-Aware Bilingual Stance Detection in Social Media Using Transformer Architecture

Key Points

To develop a reliable and explainable bilingual stance detection framework that generalizes across different languages and formats.
Developed a dual-encoder architecture based on mDeBERTa-v3.
Implemented cross-language contrastive learning for aligning English and Arabic stance representations.
Applied robustness-oriented regularization to address noisy user-generated text.
Incorporated token-level rationale extraction for enhanced interpretability.
Achieved an accuracy of 85.0% on the MT-CSD benchmark and 86.8% on the AraStance benchmark.
F1-scores of 84.7% and 86.8% respectively, demonstrating stable performance.
Confirmed effective generalization across languages without retraining on specific datasets.

Abstract

Stance detection has emerged as an essential tool in natural language processing for understanding how individuals express agreement, disagreement, or neutrality toward specific targets in social and online discourse. It plays a crucial role in bilingual and multilingual environments, including English-Arabic social media ecosystems, where differences in language structure, discourse style, and data availability pose significant challenges for reliable stance modelling. Existing approaches often struggle with target awareness, cross-lingual generalization, robustness to noisy user-generated text, and the interpretability of model decisions. This study aims to build a reliable, explainable target-aware bilingual stance-detection framework that generalizes across heterogeneous stance formats and languages without retraining on a dataset specific to the target language. Thus, a unified dual-encoder architecture based on mDeBERTa-v3 is proposed. Cross-language contrastive learning offers an auxiliary training objective to align English and Arabic stance representations in a common semantic space. Robustness-oriented regularization is used to mitigate the effects of informal language, vocabulary variation, and adversarial noise. To promote transparency and trustworthiness, the framework incorporates token-level rationale extraction, enables fine-grained interpretability, and supports analysis of hallucination. The proposed model is tested on a combined bilingual test set and two structurally distinct zero-shot benchmarks: MT-CSD and AraStance. Experimental results show consistent performance, with accuracies of 85.0% and 86.8% and F1-scores of 84.7% and 86.8% on the zero-shot benchmarks, confirming stable performance and realistic generalization. Ultimately, these findings reveal that effective bilingual stance detection can be achieved via explicit target conditioning, cross-lingual alignment, and explainability-driven design.

Target-Aware Bilingual Stance Detection in Social Media Using Transformer Architecture

Key Points

Abstract

Cite This Study