Flexibly Fair Representation Learning by Disentanglement (1906.02589v1)

Published 6 Jun 2019 in cs.LG, cs.AI, and stat.ML

Abstract: We consider the problem of learning representations that achieve group and subgroup fairness with respect to multiple sensitive attributes. Taking inspiration from the disentangled representation learning literature, we propose an algorithm for learning compact representations of datasets that are useful for reconstruction and prediction, but are also \emph{flexibly fair}, meaning they can be easily modified at test time to achieve subgroup demographic parity with respect to multiple sensitive attributes and their conjunctions. We show empirically that the resulting encoder---which does not require the sensitive attributes for inference---enables the adaptation of a single representation to a variety of fair classification tasks with new target labels and subgroup definitions.

Authors (7)

Elliot Creager (21 papers)
David Madras (17 papers)
Jörn-Henrik Jacobsen (24 papers)
Marissa A. Weis (6 papers)
Kevin Swersky (51 papers)
Toniann Pitassi (40 papers)
Richard Zemel (82 papers)

Citations (311)

View on Semantic Scholar

Summary

An Expert Overview of "Flexibly Fair Representation Learning by Disentanglement"

The paper "Flexibly Fair Representation Learning by Disentanglement" tackles the pervasive issue of fairness in machine learning by advancing methodologies for fair representation learning. This research builds upon the foundations of disentangled representation learning to offer a strategy that accommodates multiple sensitive attributes and their interactions, focusing on subgroup fairness.

Problem and Motivation

The core issue addressed in this paper is the intrinsic bias machine learning models may exhibit against specific demographic groups when trained on data from domains such as law, finance, or healthcare. Traditional fair representation learning methods often necessitate defining sensitive attributes at the training phase, limiting their adaptability to new tasks where the sensitive attributes might vary. The challenge is to develop a methodology that allows representations to be adjusted post-training to accommodate a variety of fairness constraints across tasks having different labels and sensitive attributes.

Proposed Methodology: The Flexibly Fair VAE (FFVAE)

The authors propose the Flexibly Fair VAE (FFVAE), an enhancement of traditional VAE approaches tailored to separate sensitive and non-sensitive information effectively. They leverage the disentangled representation learning approach, where individual dimensions in the learned latent representations should align with single independent semantic factors of the input data. In FFVAE, sensitive information is confined to distinct dimensions, enabling its removal or alteration thus facilitating demographic parity across multiple fairness contexts.

The authors introduce a specialized algorithm that incorporates the following components:

An encoder-decoder framework, which ensures the resulting data representations are both accurate and flexible regarding sensitive attributes.
A factorized decoder structure, which facilitates the disentanglement of sensitive information while preserving essential non-sensitive information for task prediction.
A carefully constructed hyperparameter setup that allows practitioners to trade off between predictiveness versus disentanglement based on the application's needs, thereby offering a tunable approach to fairness.

Empirical Evaluation

The effectiveness of FFVAE is demonstrated across multiple datasets:

Synthetic Datasets: Through variants of DSprites with contrived correlations among attributes, the FFVAE shows superior performance in achieving subgroup fairness without sacrificing predictive accuracy.
Communities and Crime Dataset: FFVAE handles multiple sensitive attributes, yielding better fairness versus accuracy tradeoffs compared to baseline approaches like FactorVAE.
Celebrity Faces (Celeb-A) Dataset: On this real-world dataset, FFVAE maintains competitive performance, showcasing its ability to adapt to high-dimensional data settings with complex attribute dependencies.

The FFVAE was found to outperform standard disentanglement methodologies by enabling comprehensive subgroup fairness expressed in terms of demographic parity. The authors provide an extensive set of experiments indicating that the proposed approach effectively isolates sensitive attributes, improving prediction fairness across task variations.

Implications and Future Directions

The theoretical and empirical contributions of this paper have meaningful implications for the field of AI fairness, proposing a robust framework that decouples the fairness criteria from initial model training. This allows for adaptable post-hoc fairness adjustments in model deployment, a significant advancement given diverse operational contexts and legislative environments.

Future research avenues include exploring the application of different fairness constraints beyond demographic parity, such as equal opportunity or calibration-based metrics. Additionally, enhancing the method's applicability to other model architectures or integrating causal reasoning for improved representation invariance could further extend this research's impact.

In conclusion, this paper provides a significant step towards creating machine learning systems that can be dynamically adjusted for fairness across varied sensitive attributes, offering a versatile solution to a pressing societal issue.

PDF Markdown

Related Papers

Find Related Papers