Derive k-dependent performance equations for top-k E2LSH/E2LSHoS
Derive explicit analytical equations characterizing how both the in-memory E2LSH query time and the number of hash-bucket reads (I/O operations) in E2LSH-on-Storage depend on the top-k parameter k for Euclidean c-approximate top-k nearest neighbor search with E2LSH parameters (m, L, S), to enable precise prediction of storage IOPS requirements for top-k queries.
References
we know they both grow sublinearly in n, and while we do not have equations for k, no substantial change in the IOPS requirements is observed for larger k as shown in Figure 1.
— Implementing and Evaluating E2LSH on Storage
(2403.16404 - Nakanishi et al., 25 Mar 2024) in Section 4.6 (Requirements for In-memory Speeds)