What question did this study set out to answer?

The aim is to improve similarity search efficiency in metric spaces by addressing the limitations of existing pivot-based methods.

April 10, 2026Open Access

LM-Tree: A Hybrid Learned Index for Similarity Search in Metric Spaces

Key Points

The aim is to improve similarity search efficiency in metric spaces by addressing the limitations of existing pivot-based methods.
Proposed LM-Tree which combines pivots and learning models with M-Tree.
Developed a self-adaptive node architecture for dynamic pivot selection.
Created a maintenance algorithm for handling dynamic updates like pivot adjustments and node merges.
LM-Tree significantly enhances pruning performance compared to traditional pivot-based methods.
Demonstrated efficiency in maintaining a large number of child nodes with low computational overhead.
Outperformed state-of-the-art methods in extensive experiments on various datasets.

Abstract

Similarity search in metric spaces is a fundamental problem in data management with many applications. While numerous indices have been proposed to support similarity search, most existing approaches rely on pivot-based strategies that suffer from critical limitations. Traditional single-pivot methods offer limited pruning power, while multi-pivot techniques often become inefficient as data evolves, since pivot updates incur substantial computational overhead. In this paper, we propose LM-Tree (short for Learned M-Tree), a hybrid learned index that combines pivots and learning models with M-Tree to address the above problems. LM-Tree's key innovation lies in its self-adaptive node architecture, where each node dynamically selects an appropriate number of pivots and incorporates lightweight learning models to enhance pruning efficiency. This design enables each node to maintain a relatively large number of child nodes (or objects for leaf nodes) while consistently delivering strong pruning performance, thereby enabling LM-Tree to use a small number of nodes to index objects in metric spaces. Furthermore, we develop an efficient maintenance algorithm that handles dynamic updates, including pivot adjustments, model reforms, node splits, and merges with low overhead. Extensive experiments on both real and synthetic datasets demonstrate that LM-Tree significantly outperforms state-of-the-art methods.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Wang et al. (Thu,) studied this question.

www.synapsesocial.com/papers/69d894526c1944d70ce0536e — DOI: https://doi.org/10.1145/3786665

Authors

Yaqi Wang

Bin Wang

Rui Zhu

Journals

Proceedings of the ACM on Management of Data

Actions

Institutions

Northeastern University

Shenyang Aerospace University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

LM-Tree: A Hybrid Learned Index for Similarity Search in Metric Spaces

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion