Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving the Fairness of Chest X-ray Classifiers (2203.12609v1)

Published 23 Mar 2022 in cs.LG, cs.CV, cs.CY, and eess.IV

Abstract: Deep learning models have reached or surpassed human-level performance in the field of medical imaging, especially in disease diagnosis using chest x-rays. However, prior work has found that such classifiers can exhibit biases in the form of gaps in predictive performance across protected groups. In this paper, we question whether striving to achieve zero disparities in predictive performance (i.e. group fairness) is the appropriate fairness definition in the clinical setting, over minimax fairness, which focuses on maximizing the performance of the worst-case group. We benchmark the performance of nine methods in improving classifier fairness across these two definitions. We find, consistent with prior work on non-clinical data, that methods which strive to achieve better worst-group performance do not outperform simple data balancing. We also find that methods which achieve group fairness do so by worsening performance for all groups. In light of these results, we discuss the utility of fairness definitions in the clinical setting, advocating for an investigation of the bias-inducing mechanisms in the underlying data generating process whenever possible.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Haoran Zhang (102 papers)
  2. Natalie Dullerud (10 papers)
  3. Karsten Roth (36 papers)
  4. Lauren Oakden-Rayner (1 paper)
  5. Stephen Robert Pfohl (1 paper)
  6. Marzyeh Ghassemi (96 papers)
Citations (56)

Summary

We haven't generated a summary for this paper yet.