Jury Learning: Integrating Dissenting Voices into Machine Learning Models (2202.02950v1)

Published 7 Feb 2022 in cs.HC, cs.AI, and cs.LG

Abstract: Whose labels should a ML algorithm learn to emulate? For ML tasks ranging from online comment toxicity to misinformation detection to medical diagnosis, different groups in society may have irreconcilable disagreements about ground truth labels. Supervised ML today resolves these label disagreements implicitly using majority vote, which overrides minority groups' labels. We introduce jury learning, a supervised ML approach that resolves these disagreements explicitly through the metaphor of a jury: defining which people or groups, in what proportion, determine the classifier's prediction. For example, a jury learning model for online toxicity might centrally feature women and Black jurors, who are commonly targets of online harassment. To enable jury learning, we contribute a deep learning architecture that models every annotator in a dataset, samples from annotators' models to populate the jury, then runs inference to classify. Our architecture enables juries that dynamically adapt their composition, explore counterfactuals, and visualize dissent.

Citations (131)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Jury Learning: Integrating Dissenting Voices into Machine Learning Models (2202.02950v1)

Summary

Related Papers