What question did this study set out to answer?

This research aims to review multimodal fake news detection methods and analyze their strengths and limitations.

April 21, 2026

Beyond Words and Pictures: A Survey of Multimodal Fake News Detection

Key Points

This research aims to review multimodal fake news detection methods and analyze their strengths and limitations.
Structured review of existing methods grouped into two paradigms: small-model-based methods and LLM-involved methods.
Analysis of representative approaches based on modelling strategy, factual grounding, and robustness.
Review of datasets and evaluation protocols with a focus on their limitations.
Identified major gaps in current evaluation strategies and dataset compositions.
Highlighted the challenges of detecting LLM-generated misinformation and cross-lingual generalization.
Outlined future research directions for interpretable reasoning and trustworthy evaluation.

Abstract

ABSTRACT Multimodal fake news detection (MFND) has attracted growing attention as misinformation increasingly appears in heterogeneous forms that combine text, images, audio, video, and social context, while recent generative models further increase the realism and scalability of deceptive content. Meanwhile, the rise of large language models (LLMs) and multimodal large language models (MLLMs) has introduced new opportunities as well as new evaluation challenges for MFND. In this survey, we present a structured review of the field through a taxonomy that groups existing methods into two broad paradigms: small‐model‐based methods and LLM‐involved methods. For each paradigm, we analyse representative approaches from the perspectives of modelling strategy, factual grounding, robustness, and deployment feasibility. We also review widely used datasets and evaluation protocols, and discuss their limitations with respect to modality composition, task heterogeneity, and result comparability. Finally, we outline important open problems and future directions, including the detection of LLM‐generated misinformation, cross‐lingual generalisation, interpretable and evidence‐grounded reasoning, and trustworthy evaluation in realistic deployment settings.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Hu et al. (Sun,) studied this question.

www.synapsesocial.com/papers/69e713decb99343efc98d46d — DOI: https://doi.org/10.1111/exsy.70260

Authors

Tingqi Hu

Ying Lei

Yiduo Wang

Journals

Expert Systems

Actions

Institutions

Zhengzhou University

Jiangxi University of Finance and Economics

Zhengzhou University of Science and Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Beyond Words and Pictures: A Survey of Multimodal Fake News Detection

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion