What question did this study set out to answer?

The aim is to enhance the detection of AI-generated text by leveraging advanced language models for better accuracy.

March 28, 2026Open Access

LLM-generated text detection: enhancing accuracy using XLM-RoBERTa & DistilBERT model

Key Points

The aim is to enhance the detection of AI-generated text by leveraging advanced language models for better accuracy.
Utilized DistilBERT and XLM-RoBERTa to extract deep semantic features from text.
Employed the Extreme Gradient Boosting (XGBoost) algorithm for classification.
Evaluated models on multiple English datasets, including Daigt-V4 and LLM-Detect AI-Generated Text.
Applied a layer-weighting mechanism for multilingual evaluation on the Urdu Human and AI Text dataset.
XLM-RoBERTa achieved 94% accuracy, while DistilBERT achieved 96% on the Daigt-V4 dataset.
Both models reached 99% accuracy on the LLM-Detect AI-Generated Text dataset.
The layer-weighting mechanism enhanced performance, achieving 85% accuracy on the Urdu dataset.

Abstract

Large Language Models (LLMs) have advanced rapidly, driving major progress in Natural Language Processing (NLP) tasks. However, this advancement has also raised significant concerns about its potential misuse, particularly in academic and research contexts, such as the production of unoriginal or fabricated content. To address this challenge, we propose a robust approach for detecting artificial intelligence (AI)-generated text that emphasizes a deep understanding of linguistic context. Our methodology leverages cutting-edge language models, specifically Distilled Bidirectional Encoder Representations from Transformers (DistilBERT) and Cross-lingual Language Model-Robustly Optimized BERT Pretraining Approach (XLM-RoBERTa), to extract deep semantic features from text. These features capture intricate linguistic nuances, including contextual cues, stylistic patterns, and semantic relationships, which are critical for accurately differentiating human-written text from machine-generated text (MGT). We employ the Extreme Gradient Boosting (XGBoost) algorithm to enhance classification accuracy, a powerful machine learning technique renowned for its efficiency and predictive capability. We evaluated the proposed approach on two extensive English datasets, Daigt-V4 and LLM-Detect AI-Generated Text, extracting features primarily from the uppermost transformer layers that capture high-level semantic information. Similarly, for multilingual evaluation using XLM-RoBERTa on the Urdu Human and AI Text (UHAT) dataset, we applied a layer-weighting mechanism that combines representations from all transformer layers. This mechanism assigns trainable weights to each layer’s output, enabling the model to balance low-level syntactic and high-level semantic patterns, thereby enhancing cross-lingual robustness. Our experiments showed that DistilBERT performed well in comparison with XLM-RoBERTa by an average of 2% on the Daigt-V4 dataset. Specifically, XLM-RoBERTa achieved 94% accuracy, while DistilBERT reached 96% accuracy on the same dataset. On the LLM-Detect AI-Generated Text dataset, both models achieved 99% accuracy. In contrast, on the UHAT dataset, the model achieved a promising accuracy of 85%, demonstrating the effectiveness of the layer-weighting mechanism in handling cross-lingual challenges.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Akabra Javed

Zakia Jalil

Muhammad Nasir

Journals

PeerJ Computer Science

Actions

Institutions

Umm al-Qura University

International Islamic University, Islamabad

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

LLM-generated text detection: enhancing accuracy using XLM-RoBERTa & DistilBERT model

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study