Faster Algorithms for Testing under Conditional Sampling (1504.04103v1)

Published 16 Apr 2015 in cs.DS, cs.CC, cs.LG, math.ST, and stat.TH

Abstract: There has been considerable recent interest in distribution-tests whose run-time and sample requirements are sublinear in the domain-size $k$. We study two of the most important tests under the conditional-sampling model where each query specifies a subset $S$ of the domain, and the response is a sample drawn from $S$ according to the underlying distribution. For identity testing, which asks whether the underlying distribution equals a specific given distribution or $\epsilon$-differs from it, we reduce the known time and sample complexities from $\tilde{\mathcal{O}}(\epsilon^{-4})$ to $\tilde{\mathcal{O}}(\epsilon^{-2})$, thereby matching the information theoretic lower bound. For closeness testing, which asks whether two distributions underlying observed data sets are equal or different, we reduce existing complexity from $\tilde{\mathcal{O}}(\epsilon^{-4} \log⁵ k)$ to an even sub-logarithmic $\tilde{\mathcal{O}}(\epsilon^{-5} \log \log k)$ thus providing a better bound to an open problem in Bertinoro Workshop on Sublinear Algorithms [Fisher, 2004].

Authors (5)

Moein Falahatgar (4 papers)
Ashkan Jafarpour (5 papers)
Alon Orlitsky (25 papers)
Venkatadheeraj Pichapathi (1 paper)
Ananda Theertha Suresh (73 papers)

Citations (29)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Faster Algorithms for Testing under Conditional Sampling (1504.04103v1)

Summary

Related Papers