Abstract
Approximate Nearest Neighbor (ANN) indexing constitutes a fundamental component of modern vector-based databases, facilitating efficient and accurate information retrieval for applications such as retrieval-augmented generation. However, scaling graph-based ANN indices to billion-scale datasets poses substantial challenges, including high memory demands and inefficiencies in handling partitioned graphs. To address these issues, we propose ScaDANN, a scalable disk-based graph indexing method tailored for large-scale datasets under limited memory conditions. ScaDANN introduces two novel techniques: overlapping block-level insertion and grid block merge, which enable the efficient construction of unified graph indices while preserving high search performance. Our approach achieves notable advancements in index construction speed, search accuracy, and memory efficiency, establishing ScaDANN as a robust and effective solution for scalable ANN indexing in resource-limited environments.
| Original language | English |
|---|---|
| Journal | CEUR Workshop Proceedings |
| Volume | 3946 |
| State | Published - 2025 |
| Event | Workshops of the EDBT/ICDT 2025 Joint Conference, EDBT/ICDT-WS 2025 - Barcelona, Spain Duration: 25 Mar 2025 → … |
Keywords
- ANN
- Graph
- Nearest neighbor search
- Vector search
Fingerprint
Dive into the research topics of 'ScaDANN: A Scalable Disk-Based Graph Indexing Method for ANN'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver