Dice Question Streamline Icon: https://streamlinehq.com

Improve the dependence on k in bounded-query-size algorithms

Improve the dependence on the number of clusters k in the query complexity of the non-adaptive k-clustering algorithm with bounded query size s ≤ √n, which currently uses \widetilde{O}(n^2 k/s^2) subset queries, aiming to reduce or eliminate the linear k factor while maintaining near-optimal dependence on s.

Information Square Streamline Icon: https://streamlinehq.com

Background

For bounded query size s, the paper presents a non-adaptive algorithm with query complexity \widetilde{O}(n2 k/s2) (optimal up to logarithmic factors for constant k) and another with \widetilde{O}(n2/s), each trading off dependence on k and s. The lower bound is Ω(max(n2/s2, n)) even for k=3.

The authors explicitly identify improving the dependence on k in their \widetilde{O}(n2 k/s2) algorithm as an open problem, highlighting a gap between current upper bounds and the lower bound when k is not constant.

References

In particular, for constant $k$, the above algorithm matches the lower bound of \Cref{thm:LB-NA} up to $\log n$ factors. We leave the question of improving the dependence on $k$ as an interesting open problem.

Clustering with Non-adaptive Subset Queries (2409.10908 - Black et al., 17 Sep 2024) in Section 1.1 (Results), Bounding query size paragraph