Aggregate queries often require computing large intermediate joins despite producing only small outputs. We identify broad classes of acyclic aggregate queries that can be evaluated without materialising any join results, using a bottom-up, semi-join–based propagation of cardinalities and partial aggregates. An implementation in Spark SQL shows that this approach is widely applicable and yields substantial performance gains on standard benchmarks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Lanzinger et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69be37726e48c4981c677274 — DOI: https://doi.org/10.4230/lipics.icdt.2026.24
Matthias Lanzinger
Reinhard Pichler
Alexander Selzer
TU Wien
Building similarity graph...
Analyzing shared references across papers
Loading...