Achieving high-performance filtered kNN on large-scale datasets with large query batches
Develop a filtered k-nearest neighbors (kNN) search method that attains high performance on GPUs for large-scale datasets when processing large query batches, overcoming the inefficiencies of existing NVIDIA cuVS brute-force approaches that use bitmap post-processing over all points or CSR-based sparse masked matrix multiplication that incurs scattered memory access patterns.
Sponsor
References
As a result, it is unclear how to achieve high performance filtered kNN on large-scale datasets for large batches.
— VecFlow: A High-Performance Vector Data Management System for Filtered-Search on GPUs
(2506.00812 - Xi et al., 1 Jun 2025) in Section 4.2, subsubsection "Bottom-Level GPU-Friendly Brute Force Search for LS"