Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

When Fairness Meets Privacy: Fair Classification with Semi-Private Sensitive Attributes (2207.08336v2)

Published 18 Jul 2022 in cs.LG, cs.AI, cs.CR, and cs.CY

Abstract: Machine learning models have demonstrated promising performance in many areas. However, the concerns that they can be biased against specific demographic groups hinder their adoption in high-stake applications. Thus, it is essential to ensure fairness in machine learning models. Most previous efforts require direct access to sensitive attributes for mitigating bias. Nonetheless, it is often infeasible to obtain large-scale users' sensitive attributes considering users' concerns about privacy in the data collection process. Privacy mechanisms such as local differential privacy (LDP) are widely enforced on sensitive information in the data collection stage due to legal compliance and people's increasing awareness of privacy. Therefore, a critical problem is how to make fair predictions under privacy. We study a novel and practical problem of fair classification in a semi-private setting, where most of the sensitive attributes are private and only a small amount of clean ones are available. To this end, we propose a novel framework FairSP that can achieve Fair prediction under the Semi-Private setting. First, FairSP learns to correct the noise-protected sensitive attributes by exploiting the limited clean sensitive attributes. Then, it jointly models the corrected and clean data in an adversarial way for debiasing and prediction. Theoretical analysis shows that the proposed model can ensure fairness under mild assumptions in the semi-private setting. Extensive experimental results on real-world datasets demonstrate the effectiveness of our method for making fair predictions under privacy and maintaining high accuracy.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Canyu Chen (26 papers)
  2. Yueqing Liang (14 papers)
  3. Xiongxiao Xu (10 papers)
  4. Shangyu Xie (11 papers)
  5. Ashish Kundu (36 papers)
  6. Ali Payani (48 papers)
  7. Yuan Hong (46 papers)
  8. Kai Shu (88 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.