Multi-Branch Network for Imagery Emotion Prediction (2312.07500v1)
Abstract: For a long time, images have proved perfect at both storing and conveying rich semantics, especially human emotions. A lot of research has been conducted to provide machines with the ability to recognize emotions in photos of people. Previous methods mostly focus on facial expressions but fail to consider the scene context, meanwhile scene context plays an important role in predicting emotions, leading to more accurate results. In addition, Valence-Arousal-Dominance (VAD) values offer a more precise quantitative understanding of continuous emotions, yet there has been less emphasis on predicting them compared to discrete emotional categories. In this paper, we present a novel Multi-Branch Network (MBN), which utilizes various source information, including faces, bodies, and scene contexts to predict both discrete and continuous emotions in an image. Experimental results on EMOTIC dataset, which contains large-scale images of people in unconstrained situations labeled with 26 discrete categories of emotions and VAD values, show that our proposed method significantly outperforms state-of-the-art methods with 28.4% in mAP and 0.93 in MAE. The results highlight the importance of utilizing multiple contextual information in emotion prediction and illustrate the potential of our proposed method in a wide range of applications, such as effective computing, human-computer interaction, and social robotics. Source code: https://github.com/BaoNinh2808/Multi-Branch-Network-for-Imagery-Emotion-Prediction
- balmukund. 2022. FER-2013 pytorch implementation. https://www.kaggle.com/code/balmukund/fer-2013-pytorch-implementation?fbclid=IwAR3xaZrtY7-RDZiXHGcjf6ytJ5Nk4wMDxGhsQs0pg2R0ul7GNv7lgS3ePI8
- Lisa Feldman Barrett. 2017. How emotions are made: The secret life of the brain. Pan Macmillan.
- Emotional expressions reconsidered: Challenges to inferring emotion from human facial movements. Psychological Science in the Public Interest 17, 1 (2016), 1–68.
- Context in emotion perception. Current directions in psychological science 20, 5 (2011), 286–290.
- Associating facial expressions and upper-body gestures with learning tasks for enhancing intelligent tutoring systems. International Journal of Artificial Intelligence in Education 30 (2020), 236–270.
- Google AI Blog. 2021. Understanding Contextual Facial Expressions Across the Globe. https://ai.googleblog.com/2021/05/understanding-contextual-facial.html
- Alan S Cowen and Dacher Keltner. 2017. Self-report captures 27 distinct categories of emotion bridged by continuous gradients. Proceedings of the national academy of sciences 114, 38 (2017), E7900–E7909.
- Joel R Davitz. 1964. Expression of Emotion in Man and Animals. New York: McGraw-Hill (1964).
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248–255.
- S. Dixon. 2023. Number of global social network users 2017-2027. https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/
- A survey on deep learning and its applications. Computer Science Review 40 (2021), 100379.
- Universals and cultural differences in the judgments of facial expressions of emotion. Journal of Personality and Social Psychology 53, 4 (1987), 712–723. https://doi.org/10.1037/0022-3514.53.4.712
- Google. 2021. Face Detection. https://colab.research.google.com/github/dortmans/ml_notebooks/blob/master/face_detection.ipynb. Accessed: 2023-12-09.
- Artificial intelligence (AI) applications for marketing: A literature-based study. International Journal of Intelligent Networks (2022).
- On the importance of both dimensional and discrete models of emotion. Behavioral Sciences 7, 4 (2017), 66. https://doi.org/10.3390/bs7040066
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
- The neural representation of visually evoked emotion is high-dimensional, categorical, and distributed across transmodal brain regions. Iscience 23, 5 (2020), 101060.
- Attention-based multimodal contextual fusion for sentiment and emotion classification using bidirectional LSTM. Multimedia Tools and Applications 80, 9 (2021), 13059–13076.
- EMOTIC: Emotions in Context dataset. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 61–69.
- Context Based Emotion Recognition Using EMOTIC Dataset. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 11 (2020), 2755–2766. https://doi.org/10.1109/TPAMI.2019.2916866
- Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision. 10012–10022.
- Facial expression recognition with visual transformers and attentional selective fusion. IEEE Transactions on Affective Computing (2021).
- Survey on AI-Based Multimodal Methods for Emotion Detection. High-performance modelling and simulation for big data applications 11400 (2019), 307–324.
- Emotion Classification Based on Biophysical Signals and Machine Learning Techniques. Symmetry 12 (12 2019), 21. https://doi.org/10.3390/sym12010021
- EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege’s Principle. arXiv:2003.06692 [cs.AI]
- ARVIND NARAYANAN. 2023. Understanding Social Media Recommendation Algorithms. https://knightcolumbia.org/content/understanding-social-media-recommendation-algorithms
- Survey on emotional body gesture recognition. IEEE transactions on affective computing 12, 2 (2018), 505–523.
- Facial Expression Recognition 2013 Dataset. https://www.kaggle.com/datasets/msambare/fer2013
- Andrey V Savchenko. 2021. Facial expression and attributes recognition based on multi-task learning of lightweight neural networks. In 2021 IEEE 19th International Symposium on Intelligent Systems and Informatics (SISY). IEEE, 119–124.
- Recognizing emotions expressed by body pose: A biologically inspired neural model. Neural networks 21, 9 (2008), 1238–1246.
- Akash Shangeth. 2022. Facial Emotion Recognition PyTorch ONNX. https://github.com/shangeth/Facial-Emotion-Recognition-PyTorch-ONNX
- Roza Tsvetkova. 2023. 99 Amazing Social Media Statistics and Facts. https://www.brandwatch.com/blog/amazing-social-media-statistics-and-facts/
- Distract your attention: multi-head cross attention network for facial expression recognition. Biomimetics 8, 2 (2023), 199.
- Facial expression megamix: Tests of dimensional and category accounts of emotion recognition. Cognition 63, 3 (1997), 271–313.
- Places: A 10 million Image Database for Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2017).
- Quoc-Bao Ninh (1 paper)
- Hai-Chan Nguyen (1 paper)
- Triet Huynh (1 paper)
- Trung-Nghia Le (42 papers)