Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Causal Feature Selection for Algorithmic Fairness (2006.06053v2)

Published 10 Jun 2020 in cs.LG, cs.CY, cs.DB, and stat.ML

Abstract: The use of ML in high-stakes societal decisions has encouraged the consideration of fairness throughout the ML lifecycle. Although data integration is one of the primary steps to generate high quality training data, most of the fairness literature ignores this stage. In this work, we consider fairness in the integration component of data management, aiming to identify features that improve prediction without adding any bias to the dataset. We work under the causal interventional fairness paradigm. Without requiring the underlying structural causal model a priori, we propose an approach to identify a sub-collection of features that ensure the fairness of the dataset by performing conditional independence tests between different subsets of features. We use group testing to improve the complexity of the approach. We theoretically prove the correctness of the proposed algorithm to identify features that ensure interventional fairness and show that sub-linear conditional independence tests are sufficient to identify these variables. A detailed empirical evaluation is performed on real-world datasets to demonstrate the efficacy and efficiency of our technique.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Sainyam Galhotra (28 papers)
  2. Karthikeyan Shanmugam (85 papers)
  3. Prasanna Sattigeri (70 papers)
  4. Kush R. Varshney (121 papers)
Citations (36)

Summary

We haven't generated a summary for this paper yet.