Skip to main navigation Skip to search Skip to main content

ScaDANN: A Scalable Disk-Based Graph Indexing Method for ANN

  • Korea Advanced Institute of Science and Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

Approximate Nearest Neighbor (ANN) indexing constitutes a fundamental component of modern vector-based databases, facilitating efficient and accurate information retrieval for applications such as retrieval-augmented generation. However, scaling graph-based ANN indices to billion-scale datasets poses substantial challenges, including high memory demands and inefficiencies in handling partitioned graphs. To address these issues, we propose ScaDANN, a scalable disk-based graph indexing method tailored for large-scale datasets under limited memory conditions. ScaDANN introduces two novel techniques: overlapping block-level insertion and grid block merge, which enable the efficient construction of unified graph indices while preserving high search performance. Our approach achieves notable advancements in index construction speed, search accuracy, and memory efficiency, establishing ScaDANN as a robust and effective solution for scalable ANN indexing in resource-limited environments.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume3946
StatePublished - 2025
EventWorkshops of the EDBT/ICDT 2025 Joint Conference, EDBT/ICDT-WS 2025 - Barcelona, Spain
Duration: 25 Mar 2025 → …

Keywords

  • ANN
  • Graph
  • Nearest neighbor search
  • Vector search

Fingerprint

Dive into the research topics of 'ScaDANN: A Scalable Disk-Based Graph Indexing Method for ANN'. Together they form a unique fingerprint.

Cite this