Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Correcting Underrepresentation and Intersectional Bias for Classification (2306.11112v4)

Published 19 Jun 2023 in cs.LG, cs.CY, cs.DS, and stat.ML

Abstract: We consider the problem of learning from data corrupted by underrepresentation bias, where positive examples are filtered from the data at different, unknown rates for a fixed number of sensitive groups. We show that with a small amount of unbiased data, we can efficiently estimate the group-wise drop-out rates, even in settings where intersectional group membership makes learning each intersectional rate computationally infeasible. Using these estimates, we construct a reweighting scheme that allows us to approximate the loss of any hypothesis on the true distribution, even if we only observe the empirical error on a biased sample. From this, we present an algorithm encapsulating this learning and reweighting process along with a thorough empirical investigation. Finally, we define a bespoke notion of PAC learnability for the underrepresentation and intersectional bias setting and show that our algorithm permits efficient learning for model classes of finite VC dimension.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Michelle Alexander. The new jim crow. Ohio St. J. Crim. L., 9:7, 2011.
  2. Learning from noisy examples. Machine Learning, 2(4):343–370, 1988.
  3. Intersectionality in quantitative research: A systematic review of its emergence and applications of theory and methods. SSM-population health, 14:100798, 2021.
  4. Are emily and greg more employable than lakisha and jamal? a field experiment on labor market discrimination. American economic review, 94(4):991–1013, 2004.
  5. Alexander Bird. Philosophy of science, volume 5. McGill-Queen’s Press-MQUP, 1998.
  6. Recovering from biased data: Can fairness constraints improve accuracy? arXiv preprint arXiv:1912.01094, 2019.
  7. Causally interpreting intersectionality theory. Philosophy of Science, 83(1):60–81, 2016.
  8. Sample-efficient strategies for learning in the presence of noise. Journal of the ACM (JACM), 46(5):684–719, 1999.
  9. Alexandra Chouldechova. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, 5(2):153–163, 2017.
  10. The frontiers of fairness in machine learning. arXiv preprint arXiv:1810.08810, 2018.
  11. Patricia Hill Collins. Black feminist thought: Knowledge, consciousness, and the politics of empowerment. routledge, 2022.
  12. Kimberlé Crenshaw. Demarginalizing the intersection of race and sex: A black feminist critique of antidiscrimination doctrine, feminist theory and antiracist politics. u. Chi. Legal f., page 139, 1989.
  13. Tommy J Curry. The man-not: Race, class, genre, and the dilemmas of black manhood. Temple University Press, 2017.
  14. Tommy J Curry. Killing boogeymen: Phallicism and the misandric mischaracterizations of black males in theory. Res Philosophica, 2018.
  15. Tommy J Curry. II—Must There Be an Empirical Basis for the Theorization of Racialized Subjects in Race-Gender Theory? Proceedings of the Aristotelian Society, 121(1):21–44, 01 2021. ISSN 0066-7374. doi: 10.1093/arisoc/aoaa021. URL https://doi.org/10.1093/arisoc/aoaa021.
  16. Distribution-independent pac learning of halfspaces with massart noise. Advances in Neural Information Processing Systems, 32, 2019.
  17. Fairness through awareness. In Proceedings of the 3rd innovations in theoretical computer science conference, pages 214–226, 2012.
  18. Equality of opportunity in supervised learning. Advances in neural information processing systems, 29, 2016.
  19. Classifying without discriminating. In 2009 2nd international conference on computer, control and communication, pages 1–6. IEEE, 2009.
  20. Selection problems in the presence of implicit bias. arXiv preprint arXiv:1801.03533, 2018.
  21. Inherent trade-offs in the fair determination of risk scores. arXiv preprint arXiv:1609.05807, 2016.
  22. New trends in gender and mathematics performance: a meta-analysis. Psychological bulletin, 136(6):1123, 2010.
  23. Judea Pearl. Causality. Cambridge university press, 2009.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com