What question did this study set out to answer?

The aim is to improve the reliability of long-form evidence integration in language model outputs.

April 13, 2026Open Access

TM-RAG: a transformer-mamba model for long-text evidence aggregation in retrieval-augmented generation

Key Points

The aim is to improve the reliability of long-form evidence integration in language model outputs.
Introduced TM-RAG, combining a Transformer architecture with Mamba.
Implemented a CAGF dynamic feature fusion module for better long-range dependency modeling.
Developed a multi-level contrastive learning objective across sentence, slot, and token levels.
Generation F1 score improved from 0.5376 to 0.551 on Zuo Zongtang dataset.
BLEU score increased from 0.6491 to 0.6634 on the same dataset.
Showed stable performance improvements on HotpotQA, MuSiQue and SQuAD benchmarks.

Abstract

Retrieval-Augmented Generation (RAG) incorporates externally retrieved evidence to support generation and has been widely used to mitigate hallucinations in large language models (LLMs). In real-world settings, long-form evidence makes it difficult to jointly encode global semantics and salient elements, leading retrieval to favor topical similarity over factual consistency. To address this issue, we propose TM-RAG, which couples a Transformer with Mamba and introduces a CAGF dynamic feature fusion module to enhance long-range dependency modeling and global semantic representation. We further design a multi-level contrastive learning objective—sentence-level, slot-level, and token-level masked-recovery contrastive learning—to strengthen global semantic alignment and fine-grained factual modeling. Experiments demonstrate that TM-RAG delivers stable improvements on the Chinese Zuo Zongtang historical dataset as well as HotpotQA, MuSiQue and SQuAD benchmarks; on the Zuo Zongtang dataset, the generation F1 increases from 0.5376 to 0.551 and BLEU from 0.6491 to 0.6634, validating the effectiveness of the proposed method.

Bookmark

View Full Paper

Cite This Study

Hu et al. (Sat,) studied this question.

synapsesocial.com/papers/69dc89183afacbeac03eac89 https://doi.org/https://doi.org/10.1007/s44443-026-00723-5

Bookmark

View Full Paper