Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems

Published 11 May 2018 in cs.CL | (1805.04508v1)

Abstract: Automatic machine learning systems can inadvertently accentuate and perpetuate inappropriate human biases. Past work on examining inappropriate biases has largely focused on just individual systems. Further, there is no benchmark dataset for examining inappropriate biases in systems. Here for the first time, we present the Equity Evaluation Corpus (EEC), which consists of 8,640 English sentences carefully chosen to tease out biases towards certain races and genders. We use the dataset to examine 219 automatic sentiment analysis systems that took part in a recent shared task, SemEval-2018 Task 1 'Affect in Tweets'. We find that several of the systems show statistically significant bias; that is, they consistently provide slightly higher sentiment intensity predictions for one race or one gender. We make the EEC freely available.

Abstract PDF Upgrade to Chat

Citations (413)

View on Semantic Scholar

Summary

The paper demonstrates that 75–86% of sentiment systems exhibit statistically significant gender bias in predicting emotions.
The study employs the Equity Evaluation Corpus with 8,640 sentences to assess bias via paired comparisons with demographic identifiers.
The research reveals that systems without bias often show lower overall sentiment prediction accuracy, raising concerns about fairness in AI.

Analysis of Gender and Race Bias in Sentiment Analysis Systems

The paper "Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems" by Kiritchenko and Mohammad presents a comprehensive study regarding the biases found in sentiment analysis systems. The authors introduce the Equity Evaluation Corpus (EEC), a benchmark dataset containing 8,640 carefully selected English sentences designed to reveal biases related to gender and race in sentiment analysis systems.

Overview

The study examines 219 sentiment analysis systems that participated in the SemEval-2018 Task 1 'Affect in Tweets'. The primary focus is on identifying whether these systems exhibit biased sentiment predictions based on the race or gender of the subjects mentioned in the text.

Methodology

The researchers created the EEC using eleven sentence templates that were manipulated to include variables corresponding to gender and race. These templates were filled with names and noun phrases typically associated with African American and European American identities, and with both male and female genders. Emotional states and situational words were incorporated to reflect various sentiment intensities.

The sentiment systems' outputs on these sentences were analyzed for bias by comparing predictions on pairs of sentences differing only in a gender or race-specific word. Statistical significance tests were performed to determine whether these differences were meaningful.

Key Findings

Gender Bias: An overwhelming majority of the systems (75% to 86%) exhibited statistically significant biases. Systems tended to assign higher sentiment scores to sentences involving one gender over the other, with variations depending on the emotion task (anger, fear, joy, sadness).
Race Bias: Similarly, biases were more prevalent concerning racial identifiers, with systems showing sentiment prediction differences favoring sentences associated with either African American or European American names differently for each emotion type.
Performance and Bias Correlation: Systems showing no significant bias often performed worse in accurately predicting sentiment on standard test sets.

Implications

The study provides evidence of widespread bias in current sentiment analysis models, underscoring the risk of perpetuating inappropriate human biases in automated systems. These biases carry the potential to affect downstream applications such as recommendation systems, behavioral analysis, and customer service.

Future Directions

Further research needs to address the sources of these biases, which may originate from word embeddings, lexicons, or the datasets used for training. Additionally, examining the differential performance of systems on text influenced by varied demographic attributes remains an open area of research, with possible extensions of the current corpus to cover additional identities and biases.

The Equity Evaluation Corpus is a noteworthy contribution in its aim to serve as a resource in analyzing and mitigating biases in NLP systems. However, it is acknowledged that measuring bias is complex and the proposed dataset is part of a broader suite of tools necessary for a thorough examination.

Conclusion

This investigation into the biases of sentiment analysis systems highlights critical issues that need addressing as these systems become more integrated into aspects of decision-making. The availability of the EEC allows further examination and potential mitigation steps to be undertaken by the research community to improve fairness in AI.

Markdown