What question did this study set out to answer?

The aim is to improve the performance of merging vector indexes in databases, specifically using the HNSW approach.

April 10, 2026Open Access

Efficient Vector Index Merging in Vector Databases

Key Points

The aim is to improve the performance of merging vector indexes in databases, specifically using the HNSW approach.
Developed a new algorithm named HNSW-Merger
Implemented a two-stage, search-based merging process
Utilized forward HNSW search and lazy backward direct-connect techniques
Optimized for multi-core parallel processing
Supported merging of multiple indexes efficiently
HNSW-Merger significantly outperformed existing index merging methods
Maintained or enhanced index quality during merging
Demonstrated faster merging times in extensive experiments

Abstract

Vector databases have become a cornerstone of modern data science and AI applications, powering recommendation systems, semantic search, retrieval-augmented generation, and more. This paper focuses on vector index merging (particularly HNSW merging), which merges two (or more) vector indexes. This is a key operation in vector databases with many use cases in vector index construction and vector index updates. While there are a few early approaches to solve the problem, the index merging performance remains slow. In this work, we propose HNSW-Merger, a new algorithm for merging two (or more) HNSW indexes that fully exploits the proximity information in existing indexes. It is a novel two-stage, search-based algorithm that relies on forward HNSW search and lazy backward direct-connect to efficiently connect potential edges. HNSW-Merger is optimized for multi-core parallelism and memory efficiency. It also supports efficient merging of multiple indexes. Extensive experiments show that HNSW-Merger achieves significantly faster merging performance than prior approaches while maintaining similar or even higher index quality.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Chenzhe Jin

Yunan Zhang

Jiayi Liu

Journals

Proceedings of the ACM on Management of Data

Actions

Institutions

Purdue University West Lafayette

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Efficient Vector Index Merging in Vector Databases

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study