Dual-Balancing for Multi-Task Learning (2308.12029v2)
Abstract: Multi-task learning (MTL), a learning paradigm to learn multiple related tasks simultaneously, has achieved great success in various fields. However, task balancing problem remains a significant challenge in MTL, with the disparity in loss/gradient scales often leading to performance compromises. In this paper, we propose a Dual-Balancing Multi-Task Learning (DB-MTL) method to alleviate the task balancing problem from both loss and gradient perspectives. Specifically, DB-MTL ensures loss-scale balancing by performing a logarithm transformation on each task loss, and guarantees gradient-magnitude balancing via normalizing all task gradients to the same magnitude as the maximum gradient norm. Extensive experiments conducted on several benchmark datasets consistently demonstrate the state-of-the-art performance of DB-MTL.
- Baijiong Lin (15 papers)
- Weisen Jiang (15 papers)
- Feiyang Ye (17 papers)
- Yu Zhang (1400 papers)
- Pengguang Chen (20 papers)
- Ying-Cong Chen (47 papers)
- Shu Liu (146 papers)
- James T. Kwok (65 papers)