What question did this study set out to answer?

This research aims to improve the performance and stability of learned query optimizers when faced with diverse workloads.

April 10, 2026Open Access

Divo: Learning a Stable and Effective Query Optimizer with a Diverse Workload

Key Points

This research aims to improve the performance and stability of learned query optimizers when faced with diverse workloads.
Developed a template-evolving query generator to create 3,000 diverse queries.
Established a two-phase model training pipeline using over 100,000 plans from public workloads and generated queries.
Introduced a diversity-aware loss function to manage varied input queries and inconsistencies.
Achieved a 1.3× speedup in total execution latency compared to PostgreSQL.
Demonstrated a performance improvement of 14.9× relative to six existing learned query optimizers on average.

Abstract

Learned query optimizers (LQOs) have shown remarkable progress in recent years. Despite achieving competitive performance compared with traditional methods, current LQOs struggle to retain stability and generate efficient plans when exposed to workloads with substantial diversity, presenting a major challenge in real-world deployment. Divo is a learned query optimizer designed to overcome performance degradation observed in training on diverse workloads. First, Divo proposes a template-evolving query generator to provide diverse queries for sufficient training. By modifying and combining query templates, the query generator synthesizes 3,000 informative new queries with fidelity to existing workloads. Second, Divo establishes a two-phase model training pipeline to enhance RL training with numerous plans collected in advance. We collected over 100,000 plans from public workloads and diverse generated queries to form static experiences, which are effectively leveraged on various training workload configurations to enhance Divo's plan generation through augmenting auxiliary components. Third, Divo proposes a diversity-aware loss function to learn from diversified input queries stably. By learning to predict probability distributions of latencies, Divo flexibly handles various queries and tolerates inconsistent ground truth latencies from training queries. We evaluate Divo's performance on PostgreSQL using a complicated mixed workload of JOB, DSB, Extended JOB, Stack, and generated queries, with three different training and testing workload configurations split by template. Experiments demonstrate Divo's advantage on total execution latency with a 1.3× speedup relative to PostgreSQL, which is 14.9× better than six existing LQOs on average.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Chen et al. (Thu,) studied this question.

www.synapsesocial.com/papers/69d895046c1944d70ce05fbb — DOI: https://doi.org/10.1145/3786641

Authors

Tianyi Chen

Jun Gao

Yaofeng Tu

Journals

Proceedings of the ACM on Management of Data

Actions

Institutions

Peking University

ZTE (China)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Divo: Learning a Stable and Effective Query Optimizer with a Diverse Workload

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion