Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits (2309.09832v2)
Abstract: Multi-task learning (MTL) aims to improve the performance of a primary task by jointly learning with related auxiliary tasks. Traditional MTL methods select tasks randomly during training. However, both previous studies and our results suggest that such a random selection of tasks may not be helpful, and can even be harmful to performance. Therefore, new strategies for task selection and assignment in MTL need to be explored. This paper studies the multi-modal, multi-task dialogue act classification task, and proposes a method for selecting and assigning tasks based on non-stationary multi-armed bandits (MAB) with discounted Thompson Sampling (TS) using Gaussian priors. Our experimental results show that in different training stages, different tasks have different utility. Our proposed method can effectively identify the task utility, actively avoid useless or harmful tasks, and realise the task assignment during training. Our proposed method is significantly superior in terms of UAR and F1 to the single-task and multi-task baselines with p-values < 0.05. Further analysis of experiments indicates that for the dataset with the data imbalance problem, our proposed method has significantly higher stability and can obtain consistent and decent performance for minority classes. Our proposed method is superior to the current state-of-the-art model.
- Rich Caruana, “Multitask learning,” in Learning to Learn, Sebastian Thrun and Lorien Y. Pratt, Eds., pp. 95–133. Springer, 1998.
- “Attention-augmented end-to-end multi-task learning for emotion prediction from speech,” in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019, pp. 6705–6709.
- “End-to-end multi-task learning with attention,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 1871–1880.
- “Loss-balanced task weighting to reduce negative transfer in multi-task learning,” in Proceedings of the AAAI conference on artificial intelligence, 2019, vol. 33, pp. 9977–9978.
- “Asymmetric multi-task learning based on task relatedness and loss,” in International conference on machine learning. PMLR, 2016, pp. 230–238.
- “Incorporating discourse features into confidence scoring of intention recognition results in spoken dialogue systems,” Speech Communication, vol. 48, no. 3-4, pp. 417–436, 2006.
- “Dialogue generation in character-based interactive storytelling,” in Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2005, vol. 1, pp. 21–26.
- “Towards an open-domain conversational system fully based on natural language processing,” in Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, 2014, pp. 928–939.
- “Date: a dialogue act tagging scheme for evaluation of spoken dialogue systems,” in Proceedings of the first international conference on Human language technology research, 2001.
- “Dialogue act classification in domain-independent conversations using a deep recurrent neural network,” in Proceedings of coling 2016, the 26th international conference on computational linguistics: Technical papers, 2016, pp. 2012–2021.
- “Towards emotion-aided multi-modal dialogue act classification,” in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020, pp. 4361–4372.
- “Emotion aided dialogue act classification for task-independent conversations in a multi-modal framework,” Cognitive Computation, vol. 13, pp. 277–289, 2021.
- “Autosem: Automatic task selection and mixing in multi-task learning,” in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio, Eds. 2019, pp. 3520–3531, Association for Computational Linguistics.
- “Further optimal regret bounds for thompson sampling,” in Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2013, Scottsdale, AZ, USA, April 29 - May 1, 2013. 2013, vol. 31 of JMLR Workshop and Conference Proceedings, pp. 99–107, JMLR.org.
- “Discounted thompson sampling for non-stationary bandit problems,” CoRR, vol. abs/2305.10718, 2023.
- “IEMOCAP: interactive emotional dyadic motion capture database,” Lang. Resour. Evaluation, vol. 42, no. 4, pp. 335–359, 2008.
- “A multi-task learning framework for emotion recognition using 2d continuous space,” IEEE Transactions on affective computing, vol. 8, no. 1, pp. 3–14, 2015.
- Bernard L Welch, “The generalization of ‘student’s’problem when several different population variances are involved,” Biometrika, vol. 34, no. 1-2, pp. 28–35, 1947.
- Xiangheng He (8 papers)
- Junjie Chen (89 papers)
- Björn W. Schuller (153 papers)