Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models (2307.10522v1)

Published 20 Jul 2023 in cs.CL

Abstract: Recent studies have revealed that the widely-used Pre-trained LLMs (PLMs) propagate societal biases from the large unmoderated pre-training corpora. Existing solutions require debiasing training processes and datasets for debiasing, which are resource-intensive and costly. Furthermore, these methods hurt the PLMs' performance on downstream tasks. In this study, we propose Gender-tuning, which debiases the PLMs through fine-tuning on downstream tasks' datasets. For this aim, Gender-tuning integrates Masked LLMing (MLM) training objectives into fine-tuning's training process. Comprehensive experiments show that Gender-tuning outperforms the state-of-the-art baselines in terms of average gender bias scores in PLMs while improving PLMs' performance on downstream tasks solely using the downstream tasks' dataset. Also, Gender-tuning is a deployable debiasing tool for any PLM that works with original fine-tuning.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (5)

Somayeh Ghanbarzadeh (2 papers)
Yan Huang (180 papers)
Hamid Palangi (52 papers)
Radames Cruz Moreno (4 papers)
Hamed Khanpour (6 papers)

Citations (7)

View on Semantic Scholar

Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models (2307.10522v1)

Related Papers