Balancing Fairness and Accuracy in Data-Restricted Binary Classification (2403.07724v1)

Published 12 Mar 2024 in cs.LG, cs.AI, cs.CY, and stat.ML

Abstract: Applications that deal with sensitive information may have restrictions placed on the data available to a ML classifier. For example, in some applications, a classifier may not have direct access to sensitive attributes, affecting its ability to produce accurate and fair decisions. This paper proposes a framework that models the trade-off between accuracy and fairness under four practical scenarios that dictate the type of data available for analysis. Prior works examine this trade-off by analyzing the outputs of a scoring function that has been trained to implicitly learn the underlying distribution of the feature vector, class label, and sensitive attribute of a dataset. In contrast, our framework directly analyzes the behavior of the optimal Bayesian classifier on this underlying distribution by constructing a discrete approximation it from the dataset itself. This approach enables us to formulate multiple convex optimization problems, which allow us to answer the question: How is the accuracy of a Bayesian classifier affected in different data restricting scenarios when constrained to be fair? Analysis is performed on a set of fairness definitions that include group and individual fairness. Experiments on three datasets demonstrate the utility of the proposed framework as a tool for quantifying the trade-offs among different fairness notions and their distributional dependencies.

References (43)

Authors (6)

Zachary McBride Lazri (3 papers)
Danial Dervovic (24 papers)
Antigoni Polychroniadou (17 papers)
Ivan Brugere (21 papers)
Dana Dachman-Soled (11 papers)
Min Wu (201 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1767763423058620530

https://twitter.com/WGOV/status/1767840793627177403

Balancing Fairness and Accuracy in Data-Restricted Binary Classification (2403.07724v1)

Summary

Related Papers

Tweets