Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency (2403.00625v1)

Published 1 Mar 2024 in cs.LG and cs.CY

Abstract: Fine-tuning pre-trained models is a widely employed technique in numerous real-world applications. However, fine-tuning these models on new tasks can lead to unfair outcomes. This is due to the absence of generalization guarantees for fairness properties, regardless of whether the original pre-trained model was developed with fairness considerations. To tackle this issue, we introduce an efficient and robust fine-tuning framework specifically designed to mitigate biases in new tasks. Our empirical analysis shows that the parameters in the pre-trained model that affect predictions for different demographic groups are different, so based on this observation, we employ a transfer learning strategy that neutralizes the importance of these influential weights, determined using Fisher information across demographic groups. Additionally, we integrate this weight importance neutralization strategy with a matrix factorization technique, which provides a low-rank approximation of the weight matrix using fewer parameters, reducing the computational demands. Experiments on multiple pre-trained models and new tasks demonstrate the effectiveness of our method.

References (38)

Citations (2)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency (2403.00625v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (2)

Don't miss out on important new AI/ML research

Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency (2403.00625v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (2)

Don't miss out on important new AI/ML research