What type of study is this?

September 10, 2025

Advancing Fact Attribution for Query Answering: Aggregate Queries and Novel Algorithms

Key Points

The novel approach achieves runtime improvements up to 3 orders of magnitude for aggregate queries.
Utilizing the Banzhaf and Shapley values, the method calculates contributions efficiently with optimizations.
Experiments with a million instances across 3 databases validate the practical application of these algorithms.
The first optimization reduces redundant calculations by leveraging similar contributions among input tuples.

Abstract

In this paper, we introduce a novel approach to computing the contribution of input tuples to the result of the query, quantified by the Banzhaf and Shapley values. In contrast to prior algorithmic work that focuses on Select-Project-Join-Union queries, ours is the first practical approach for queries with aggregates. It relies on two novel optimizations that are essential for its practicality and significantly improve the runtime performance already for queries without aggregates. The first optimization exploits the observation that many input tuples have the same contribution to the query result, so it is enough to compute the contribution of one of them. The second optimization uses the gradient of the query lineage to compute the contributions of all tuples with the same complexity as for one of them. Experiments with a million instances over 3 databases show that our approach achieves up to 3 orders of magnitude runtime improvements over the state-of-the-art for queries without aggregates, and that it is practical for aggregate queries.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Omer Abramovich

Daniel Deutch

Nave Frost

Journals

Proceedings of the VLDB Endowment

Actions

Institutions

University of Zurich

Tel Aviv University

Regensburg University of Applied Sciences

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Advancing Fact Attribution for Query Answering: Aggregate Queries and Novel Algorithms

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study