Logic bugs in distributed database management systems silently yield incorrect results and are difficult to detect, yet they threaten correctness in critical deployments. Existing testing techniques focus mainly on system level failures or general DBMS and do not target distributed execution, leaving many distributed logic bugs undiscovered. We present DistSQL, a differential testing framework that compares the results of identical SQL queries executed on the same DBMS configured in centralized and distributed modes. The key insight is that centralized execution is simpler and typically better tested, and thus can serve as a practical reference for distributed execution. DistSQL addresses two difficulties that hinder effective bug finding in distributed settings: it performs distributed diversity oriented database state mutation to expose distribution specific behaviors, and it conducts distributed interaction guided exploration of the execution space using query plan features to prioritize novel behaviors over redundant tests. DistSQL requires no intrusive code instrumentation. The evaluation on five popular open source distributed DBMSs, including TiDB, CockroachDB, YugabyteDB, ClickHouse, and OceanBase, as well as one widely deployed commercial DBMS, demonstrates the efficacy of DistSQL. DistSQL identified 65 previously unknown logic bugs, including 38 specific to distributed execution. Of these, 61 have been confirmed and 50 have been fixed.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zi-Xuan Fu
Jia-Ju Bai
Hong-Bo Feng
Proceedings of the ACM on Management of Data
Tsinghua University
Beihang University
Building similarity graph...
Analyzing shared references across papers
Loading...
Fu et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69d894ce6c1944d70ce05bd9 — DOI: https://doi.org/10.1145/3786673
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: