Improving Personalisation in Valence and Arousal Prediction using Data Augmentation (2404.09042v1)
Abstract: In the field of emotion recognition and Human-Machine Interaction (HMI), personalised approaches have exhibited their efficacy in capturing individual-specific characteristics and enhancing affective prediction accuracy. However, personalisation techniques often face the challenge of limited data for target individuals. This paper presents our work on an enhanced personalisation strategy, that leverages data augmentation to develop tailored models for continuous valence and arousal prediction. Our proposed approach, Distance Weighting Augmentation (DWA), employs a weighting-based augmentation method that expands a target individual's dataset, leveraging distance metrics to identify similar samples at the segment-level. Experimental results on the MuSe-Personalisation 2023 Challenge dataset demonstrate that our method significantly improves the performance of features sets which have low baseline performance, on the test set. This improvement in poor-performing features comes without sacrificing performance on high-performing features. In particular, our method achieves a maximum combined testing CCC of 0.78, compared to the reported baseline score of 0.76 (reproduced at 0.72). It also achieved a peak arousal and valence scores of 0.81 and 0.76, compared to reproduced baseline scores of 0.76 and 0.67 respectively. Through this work, we make significant contributions to the advancement of personalised affective computing models, enhancing the practicality and adaptability of data-level personalisation in real world contexts.
- Towards emotionally-personalized computing: Dynamic prediction of student mental states from self-manipulatory body movements. In 2009 International Conference on Emerging Technologies, pages 235–240. IEEE, 2009.
- A systematic survey on multimodal emotion recognition using learning algorithms. Intelligent Systems with Applications, 17:200171, 2023.
- F. S. Al-Anzi and D. AbuZeina. Toward an enhanced arabic text classification using cosine similarity and latent semantic indexing. Journal of King Saud University-Computer and Information Sciences, 29(2):189–195, 2017.
- N. Alswaidan and M. E. B. Menai. A survey of state-of-the-art approaches for emotion recognition in text. Knowledge and Information Systems, 62:2937–2987, 2020.
- Semi-supervised model personalization for improved detection of learner’s emotional engagement. In Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 100–107, 2016.
- Snore sound classification using image-based deep spectrum features. 2017.
- Emotion recognition from multimodal physiological signals for emotion aware healthcare systems. Journal of Medical and Biological Engineering, 40:149–157, 2020.
- A personalized affective memory model for improving emotion recognition. In International Conference on Machine Learning, pages 485–494. PMLR, 2019.
- P. Barros and A. Sciutti. Ciao! a contrastive adaptation mechanism for non-universal facial expression recognition. In 2022 10th International Conference on Affective Computing and Intelligent Interaction (ACII), pages 1–8. IEEE, 2022.
- Emotion models for textual emotion classification. In Journal of physics: conference series, volume 772, page 012063. IOP Publishing, 2016.
- Emerging properties in self-supervised vision transformers. In Proceedings of the IEEE/CVF international conference on computer vision, pages 9650–9660, 2021.
- Multisource domain adaptation and its application to early detection of fatigue. ACM Transactions on Knowledge Discovery from Data (TKDD), 6(4):1–26, 2012.
- Personalized productive engagement recognition in robot-mediated collaborative learning. In Proceedings of the 2022 International Conference on Multimodal Interaction, pages 632–641, 2022.
- The muse 2023 multimodal sentiment analysis challenge: Mimicked emotions, cross-cultural humour, and personalisation. arXiv preprint arXiv:2305.03369, 2023.
- Selective transfer machine for personalized facial expression analysis. IEEE transactions on pattern analysis and machine intelligence, 39(3):529–545, 2016.
- Audio emotion recognition using machine learning to support sound design. In Proceedings of the 14th International Audio Mostly Conference: A Journey in Sound, pages 116–123, 2019.
- Leveraging semantic information for efficient self-supervised emotion recognition with audio-textual distilled models. arXiv preprint arXiv:2305.19184, 2023.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- The geneva minimalistic acoustic parameter set (gemaps) for voice research and affective computing. IEEE transactions on affective computing, 7(2):190–202, 2015.
- Electrocardiogram-based emotion recognition systems and their applications in healthcare—a review. Sensors, 21(15):5015, 2021.
- Multi-facial patches aggregation network for facial expression recognition and facial regions contributions to emotion display. Multimedia Tools and Applications, 80:13639–13662, 2021.
- A. Huang et al. Similarity measures for text document clustering. In Proceedings of the sixth new zealand computer science research student conference (NZCSRSC2008), Christchurch, New Zealand, volume 4, pages 9–56, 2008.
- Multi-task learning for predicting health, stress, and happiness. In NIPS Workshop on Machine Learning for Healthcare, 2016.
- M. Jensen. Personality traits and nonverbal communication patterns. Int’l J. Soc. Sci. Stud., 4:57, 2016.
- Inferring student engagement in collaborative problem solving from visual cues. In Companion Publication of the 2020 International Conference on Multimodal Interaction, pages 177–181, 2020.
- A personalised approach to audiovisual humour recognition and its individual-level fairness. In Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, pages 29–36, 2022.
- Towards user-independent classification of multimodal emotional signals. In 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops, pages 1–7. IEEE, 2009.
- Abaw: Valence-arousal estimation, expression recognition, action unit detection & emotional reaction intensity estimation challenges. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5888–5897, 2023.
- Personalizing robot tutors to individuals’ learning differences. In Proceedings of the 2014 ACM/IEEE international conference on Human-robot interaction, pages 423–430, 2014.
- B. Li and L. Han. Distance weighted cosine similarity measure for text classification. In Intelligent Data Engineering and Automated Learning–IDEAL 2013: 14th International Conference, IDEAL 2013, Hefei, China, October 20-23, 2013. Proceedings 14, pages 611–618. Springer, 2013.
- A survey on personalized affective computing in human-machine interaction. arXiv preprint arXiv:2304.00377, 2023.
- Personality-assisted multi-task learning for generic and personalized image aesthetics assessment. IEEE Transactions on Image Processing, 29:3898–3910, 2020.
- Pose-aware adversarial domain adaptation for personalized facial expression recognition. arXiv preprint arXiv:2007.05932, 2020.
- Evaef: Ensemble valence-arousal estimation framework in the wild. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5862–5870, 2023.
- Multi-task multiple kernel machines for personalized pain recognition from functional near-infrared spectroscopy brain signals. In 2018 24th International Conference on Pattern Recognition (ICPR), pages 2320–2325. IEEE, 2018.
- M. Malkauthekar. Analysis of euclidean distance and manhattan distance measure in face recognition. In Third International Conference on Computational Intelligence and Information Technology (CIIT 2013), pages 503–507. IET, 2013.
- Facial expression manipulation for personalized facial action estimation. Frontiers in Signal Processing, 2:1–16, 2022.
- Y. Odaka and K. Kaneiwa. Block-segmentation vectors for arousal prediction using semi-supervised learning. Applied Soft Computing, 142:110327, 2023.
- Convolutional mkl based multimodal emotion recognition and sentiment analysis. In 2016 IEEE 16th international conference on data mining (ICDM), pages 439–448. IEEE, 2016.
- Multimodal emotion recognition for avec 2016 challenge. In Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, pages 75–82, 2016.
- Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2023.
- S. PS and G. Mahalakshmi. Emotion models: a review. International Journal of Control Theory and Applications, 10(8):651–657, 2017.
- Emotion recognition to improve e-healthcare systems in smart cities. In Research & Innovation Forum 2019: Technology, Innovation, Education, and their Social Impact 1, pages 245–254. Springer, 2019.
- Predicting states of elevated negative affect in adolescents from smartphone sensors: A novel personalized machine learning approach. Psychological Medicine, pages 1–9, 2022.
- Personalized models for facial emotion recognition through transfer learning. Multimedia Tools and Applications, 79(47-48):35811 – 35828, 2020. Cited by: 12; All Open Access, Hybrid Gold Open Access.
- Personalized models for facial emotion recognition through transfer learning. Multimedia Tools and Applications, 79:35811–35828, 2020.
- Personalized machine learning for robot perception of affect and engagement in autism therapy. Science Robotics, 3(19), 2018.
- A. Saeed and S. Trajanovski. Personalized driver stress detection with multi-task neural networks using physiological signals. arXiv preprint arXiv:1711.06116, 2017.
- Automatic context-aware inference of engagement in hmi: A survey. IEEE Transactions on Affective Computing, 2023.
- Fully automatic analysis of engagement and its relationship to personality in human-robot interactions. IEEE Access, 5:705–721, 2016.
- Learning personalised models for automatic self-reported personality recognition. In ICCV 2021 Understanding Social Behavior in Dyadic and Small Group Interactions Challenge Fact sheet: Automatic self-reported personality recognition Track, 2021.
- H. Salam and M. Chetouani. Engagement detection based on mutli-party cues for human robot interaction. In 2015 International Conference on Affective Computing and Intelligent Interaction (ACII), pages 341–347. IEEE, 2015.
- Euclidean distances as measures of speaker similarity including identical twin pairs: a forensic investigation using source and filter voice characteristics. Forensic Science International, 270:25–38, 2017.
- A. V. Savchenko. Emotieffnets for facial processing in video-based valence-arousal prediction, expression classification and action unit detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5715–5723, 2023.
- J. Schneider and M. Vlachos. Personalization of deep learning. In Data Science–Analytics and Applications: Proceedings of the 3rd International Data Science Conference–iDSC2020, pages 89–96. Springer, 2021.
- Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 815–823, 2015.
- Facial action recognition combining heterogeneous features via multikernel learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 42(4):993–1005, 2012.
- Personalized machine learning of depressed mood using wearables. Translational psychiatry, 11(1):1–18, 2021.
- Toward personalized emotion recognition: A face recognition based attention method for facial emotion recognition. In Proceedings of IEEE International Conference on Face & Gesture, 2021.
- Personality recognition by modelling person-specific cognitive processes using graph representation. In proceedings of the 29th ACM international conference on multimedia, pages 357–366, 2021.
- Recognizing emotion from speech based on age and gender using hierarchical models. Procedia Computer Science, 151:37–44, 2019.
- Predicting emotion with biosignals: A comparison of classification and regression models for estimating valence and arousal level using wearable sensors. Sensors, 23(3):1598, 2023.
- Speaker’s voice characteristics and similarity measurement using euclidean distances. In 2019 International Conference on Signal Processing and Communication (ICSC), pages 317–322. IEEE, 2019.
- A multimodal fuzzy inference system using a continuous facial expression representation for emotion detection. In Proceedings of the 14th ACM international conference on Multimodal interaction, pages 493–500, 2012.
- Continuous facial expression representation for multimodal emotion detection. International Journal of Advanced Computer Science (IJACSci), 3(5), 2013.
- Hidden emotion detection using multi-modal signals. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems, pages 1–7, 2021.
- Perceived and induced emotion responses to popular music: Categorical and dimensional models. Music Perception: An Interdisciplinary Journal, 33(4):472–492, 2016.
- The muse 2021 multimodal sentiment analysis challenge: sentiment, emotion, physiological-emotion, and stress. In Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, pages 5–14. 2021.
- Muse 2021 challenge: Multimodal emotion, sentiment, physiological-emotion, and stress detection. In Proceedings of the 29th ACM International Conference on Multimedia, pages 5706–5707, 2021.
- Personalized multitask learning for predicting tomorrow’s mood, stress, and health. IEEE Transactions on Affective Computing, 11(2):200–213, 2017.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- M. Vijaymeena and K. Kavitha. A survey on similarity measures in text mining. Machine Learning and Applications: An International Journal, 3(2):19–28, 2016.
- T. Vogt and E. André. Improving automatic emotion recognition from speech via gender differentiaion. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy, May 2006. European Language Resources Association (ELRA).
- Dawn of the transformer era in speech emotion recognition: closing the valence gap. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
- C. Wang and S. Wang. Personalized multiple facial action unit recognition through generative adversarial recognition network. In Proceedings of the 26th ACM international conference on Multimedia, pages 302–310, 2018.
- Towards personalised mental wellbeing recognition on-device using transfer learning “in the wild”. In 2021 IEEE International Smart Cities Conference (ISC2), pages 1–7. IEEE, 2021.
- Identity-adaptive facial expression recognition through expression regeneration using conditional generative adversarial networks. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pages 294–301. IEEE, 2018.
- Individuality and user-specific approach in adaptive emotion recognition model. In 2017 International Conference on Biometrics and Kansei Engineering (ICBAKE), pages 1–6. IEEE, 2017.
- Transferring age and gender attributes for dimensional emotion prediction from big speech data using hierarchical deep learning. In 2018 IEEE 4th International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing,(HPSC) and IEEE International Conference on Intelligent Data and Security (IDS), pages 20–24. IEEE, 2018.
- Personalized emotion recognition by personality-aware high-order learning of physiological signals. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 15(1s):1–18, 2019.
- A comprehensive survey on automatic facial action unit analysis. The Visual Computer, 36:1067–1093, 2020.
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days freePaper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.