Close

Presentation

This content is available for: Workshop Reg Pass. Upgrade Registration
Towards a Massive-Scale Distributed Neighborhood Graph Construction
DescriptionGraph-based approximate nearest neighbor algorithms have shown high performance and quality. However, such approaches require a large amount of memory and still take a long time to construct high-quality nearest neighbor graphs (NNGs). Using distributed memory systems is important when data is large or a shorter indexing time is desired.

We develop a distributed memory version of NN-Descent, a widely known graph-based ANN algorithm, closely following algorithmic advances by PyNN-Descent authors. Our distributed NN-Descent (DNND) is built on top of MPI and leverages two existing high-performance computing libraries: YGM (an asynchronous communication library) and Metall (a persistent memory allocator).

We evaluate the performance of DNND on an HPC system using billion-scale datasets, demonstrating that our approach shows high performance and strong scaling and has great potential for developing massive-scale NNG frameworks.
Event Type
Workshop
TimeSunday, 12 November 20233:30pm - 4pm MST
Location702
Tags
Algorithms
Applications
Architecture and Networks
Registration Categories
W