Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

New Instability Results for High Dimensional Nearest Neighbor Search (0906.0684v1)

Published 3 Jun 2009 in cs.DB and cs.IR

Abstract: Consider a dataset of n(d) points generated independently from Rd according to a common p.d.f. f_d with support(f_d) = [0,1]d and sup{f_d([0,1]d)} growing sub-exponentially in d. We prove that: (i) if n(d) grows sub-exponentially in d, then, for any query point qd in [0,1]d and any epsilon>0, the ratio of the distance between any two dataset points and qd is less that 1+epsilon with probability -->1 as d-->infinity; (ii) if n(d)>[4(1+epsilon)]d for large d, then for all qd in [0,1]d (except a small subset) and any epsilon>0, the distance ratio is less than 1+epsilon with limiting probability strictly bounded away from one. Moreover, we provide preliminary results along the lines of (i) when f_d=N(mu_d,Sigma_d).

Citations (10)

Summary

We haven't generated a summary for this paper yet.