Priv'IT: Private and Sample Efficient Identity Testing (1703.10127v3)

Published 29 Mar 2017 in cs.DS, cs.CR, cs.IT, cs.LG, math.IT, math.ST, and stat.TH

Abstract: We develop differentially private hypothesis testing methods for the small sample regime. Given a sample $\cal D$ from a categorical distribution $p$ over some domain $\Sigma$, an explicitly described distribution $q$ over $\Sigma$, some privacy parameter $\varepsilon$, accuracy parameter $\alpha$, and requirements $\beta_{\rm I}$ and $\beta_{\rm II}$ for the type I and type II errors of our test, the goal is to distinguish between $p=q$ and $d_{\rm{TV}}(p,q) \geq \alpha$. We provide theoretical bounds for the sample size $|{\cal D}|$ so that our method both satisfies $(\varepsilon,0)$-differential privacy, and guarantees $\beta_{\rm I}$ and $\beta_{\rm II}$ type I and type II errors. We show that differential privacy may come for free in some regimes of parameters, and we always beat the sample complexity resulting from running the $\chi^2$-test with noisy counts, or standard approaches such as repetition for endowing non-private $\chi^2$-style statistics with differential privacy guarantees. We experimentally compare the sample complexity of our method to that of recently proposed methods for private hypothesis testing.

Authors (3)

Bryan Cai (4 papers)
Constantinos Daskalakis (111 papers)
Gautam Kamath (68 papers)

Citations (49)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Priv'IT: Private and Sample Efficient Identity Testing (1703.10127v3)

Summary

Related Papers