Papers
Topics
Authors
Recent
2000 character limit reached

New Instability Results for High Dimensional Nearest Neighbor Search

Published 3 Jun 2009 in cs.DB and cs.IR | (0906.0684v1)

Abstract: Consider a dataset of n(d) points generated independently from Rd according to a common p.d.f. f_d with support(f_d) = [0,1]d and sup{f_d([0,1]d)} growing sub-exponentially in d. We prove that: (i) if n(d) grows sub-exponentially in d, then, for any query point qd in [0,1]d and any epsilon>0, the ratio of the distance between any two dataset points and qd is less that 1+epsilon with probability -->1 as d-->infinity; (ii) if n(d)>[4(1+epsilon)]d for large d, then for all qd in [0,1]d (except a small subset) and any epsilon>0, the distance ratio is less than 1+epsilon with limiting probability strictly bounded away from one. Moreover, we provide preliminary results along the lines of (i) when f_d=N(mu_d,Sigma_d).

Citations (10)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.