Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Priv'IT: Private and Sample Efficient Identity Testing (1703.10127v3)

Published 29 Mar 2017 in cs.DS, cs.CR, cs.IT, cs.LG, math.IT, math.ST, and stat.TH

Abstract: We develop differentially private hypothesis testing methods for the small sample regime. Given a sample $\cal D$ from a categorical distribution $p$ over some domain $\Sigma$, an explicitly described distribution $q$ over $\Sigma$, some privacy parameter $\varepsilon$, accuracy parameter $\alpha$, and requirements $\beta_{\rm I}$ and $\beta_{\rm II}$ for the type I and type II errors of our test, the goal is to distinguish between $p=q$ and $d_{\rm{TV}}(p,q) \geq \alpha$. We provide theoretical bounds for the sample size $|{\cal D}|$ so that our method both satisfies $(\varepsilon,0)$-differential privacy, and guarantees $\beta_{\rm I}$ and $\beta_{\rm II}$ type I and type II errors. We show that differential privacy may come for free in some regimes of parameters, and we always beat the sample complexity resulting from running the $\chi2$-test with noisy counts, or standard approaches such as repetition for endowing non-private $\chi2$-style statistics with differential privacy guarantees. We experimentally compare the sample complexity of our method to that of recently proposed methods for private hypothesis testing.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Bryan Cai (4 papers)
  2. Constantinos Daskalakis (111 papers)
  3. Gautam Kamath (68 papers)
Citations (49)

Summary

We haven't generated a summary for this paper yet.