Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Proxy Non-Discrimination in Data-Driven Systems (1707.08120v1)

Published 25 Jul 2017 in cs.CY and cs.LG

Abstract: Machine learnt systems inherit biases against protected classes, historically disparaged groups, from training data. Usually, these biases are not explicit, they rely on subtle correlations discovered by training algorithms, and are therefore difficult to detect. We formalize proxy discrimination in data-driven systems, a class of properties indicative of bias, as the presence of protected class correlates that have causal influence on the system's output. We evaluate an implementation on a corpus of social datasets, demonstrating how to validate systems against these properties and to repair violations where they occur.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Anupam Datta (51 papers)
  2. Matt Fredrikson (44 papers)
  3. Gihyuk Ko (3 papers)
  4. Piotr Mardziel (18 papers)
  5. Shayak Sen (12 papers)
Citations (49)

Summary

We haven't generated a summary for this paper yet.