Purpose Text adversarial techniques are a key research area in natural language processing (NLP). With the widespread application of deep learning-driven NLP in sentiment analysis and machine translation, the lack of robustness against adversarial attacks has become increasingly evident. Adversarial samples in the text domain can mislead models and the discrete nature of text distinguishes adversarial techniques in this field from others. To outline recent progress, this paper aims to review relevant studies from the past decade, summarizing advancements in adversarial attack, defense and detection methods. Design/methodology/approach This paper conducts a systematic literature review of text adversarial attacks and defenses. The review begins with an analysis of attack methods, then discusses text adversarial defense and detection methods and finally points out the challenges in both attack and defense. Findings The authors provide a clear classification of attack methods based on specific criteria, followed by an overview of defense and detection techniques, highlighting their respective strengths and limitations. Originality/value This review systematically classifies and analyzes text adversarial attack, defense and detection techniques and thoroughly explores the challenges and future development directions in this field, providing valuable reference information for researchers in this area.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69df2c50e4eeef8a2a6b157a — DOI: https://doi.org/10.1108/ijwis-10-2025-0359
Guojun Zhang
Qiaolong Ding
Lihong Wang
International Journal of Web Information Systems
Sichuan University
University of Electronic Science and Technology of China
Chengdu University
Building similarity graph...
Analyzing shared references across papers
Loading...