What question did this study set out to answer?

The central aim is to differentiate failures in retrieval-augmented generation based on query types: multi-intent and constraint-bound.

April 12, 2026Open Access

Rethinking RAG Failures on Complex Queries: Multi-Intent Coverage and Constraint-Bound Validity Risk

Key Points

The central aim is to differentiate failures in retrieval-augmented generation based on query types: multi-intent and constraint-bound.
Conducted retrieval-only experiments on GOV.UK corpus
Analyzed structurally differentiated query sets
Assessed performance based on chunk size and retrieval accuracy
MIQ performance improves significantly with increased chunk size
CBQ shows limited improvement in retrieval accuracy
CBQ retrieval often favors documents that meet only some conditions, indicating structural challenges
Failures should be understood in context of query structure and user-facing risks

Abstract

Failures on complex queries in retrieval-augmented generation (RAG) are not uniform. Depending on query structure, they differ not only in retrieval form but also in user-facing meaning and risk. This study argues that complex queries in RAG should be differentiated at least into two types: Multi-Intent Queries (MIQ), in which multiple independent questions must be covered, and Constraint-Bound Queries (CBQ), in which a single question is governed by multiple jointly required conditions. Using a GOV.UK corpus, we conducted retrieval-only experiments on structurally differentiated query sets. The results show that MIQ is primarily a coverage / segmentation problem: performance improves substantially as chunk size increases, indicating that its main difficulty lies in whether both intended sources can be adequately covered. By contrast, CBQ shows only limited improvement, with top-ranked retrieval continuing to favor topically adjacent documents that satisfy only part of the required conditions. This suggests that CBQ is not merely a problem of retrieval granularity, but a structural problem of condition preservation and validity with stronger user-facing risk. These findings show that complex-query handling in RAG should not be evaluated only in terms of retrieval accuracy. Instead, failures should be understood in relation to query structure, user-facing risk, and the difference between incomplete retrieval and potentially misleading retrieval. This study provides a structural account of complex-query failure and offers an evaluative perspective for designing safer RAG systems.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yuji K. Takahashi

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Rethinking RAG Failures on Complex Queries: Multi-Intent Coverage and Constraint-Bound Validity Risk

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study