Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Multimodal Intermediate Fusion Network with Manifold Learning for Stress Detection (2403.08077v1)

Published 12 Mar 2024 in cs.CV and cs.AI

Abstract: Multimodal deep learning methods capture synergistic features from multiple modalities and have the potential to improve accuracy for stress detection compared to unimodal methods. However, this accuracy gain typically comes from high computational cost due to the high-dimensional feature spaces, especially for intermediate fusion. Dimensionality reduction is one way to optimize multimodal learning by simplifying data and making the features more amenable to processing and analysis, thereby reducing computational complexity. This paper introduces an intermediate multimodal fusion network with manifold learning-based dimensionality reduction. The multimodal network generates independent representations from biometric signals and facial landmarks through 1D-CNN and 2D-CNN. Finally, these features are fused and fed to another 1D-CNN layer, followed by a fully connected dense layer. We compared various dimensionality reduction techniques for different variations of unimodal and multimodal networks. We observe that the intermediate-level fusion with the Multi-Dimensional Scaling (MDS) manifold method showed promising results with an accuracy of 96.00\% in a Leave-One-Subject-Out Cross-Validation (LOSO-CV) paradigm over other dimensional reduction methods. MDS had the highest computational cost among manifold learning methods. However, while outperforming other networks, it managed to reduce the computational cost of the proposed networks by 25\% when compared to six well-known conventional feature selection methods used in the preprocessing step.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. “Stress detection using deep neural networks” In BMC Medical Informatics and Decision Making 20 Springer, 2020, pp. 1–10
  2. “Multimodal Stress Detection Using Facial Landmarks and Biometric Signals” In arXiv preprint arXiv:2311.03606, 2023
  3. “A multimodal sensor dataset for continuous stress detection of nurses in a hospital” In Scientific Data 9.1 Nature Publishing Group UK London, 2022, pp. 255
  4. Ioannis Kapsouras, Spiros Nikolopoulos and Ioannis Kompatsiaris “A Deep Learning Approach Towards Multimodal Stress Detection” In Proceedings of the 2nd International Workshop on Affective Computing, 2020, pp. 1–8
  5. “Stress Detection via Multimodal Multitemporal-Scale Fusion: A Hybrid of Handcrafted and Deep Learning Features” In IEEE Transactions on Affective Computing IEEE, 2023
  6. “Multimodal Stress Detection from Multiple Heterogeneous Data Sources” In Proceedings of the 27th ACM International Conference on Multimedia, 2019, pp. 2410–2418
  7. “Computational cost improvement of neural network models in black box nonlinear system identification” In Neurocomputing 166 Elsevier, 2015, pp. 96–108
  8. “A Survey on Model Compression for Large Language Models” In arXiv preprint arXiv:2308.07633, 2023
  9. Jing Wang, Haibo He and Danil V Prokhorov “A folded neural network autoencoder for dimensionality reduction” In Procedia Computer Science 13 Elsevier, 2012, pp. 120–127
  10. “Regularization and optimization strategies in deep convolutional neural network” In arXiv preprint arXiv:1712.04711, 2017
  11. “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions” In Journal of Big Data 8.1 Springer, 2021, pp. 1–49
  12. Lam Ho Nguyen and Susan Holmes “Ten quick tips for effective dimensionality reduction” In PLOS Computational Biology 15.6 Public Library of Science, 2019, pp. e1006907
  13. Saad Al-Saadi, Abdullah Al-Dujaili and Jingpeng Li “Redundancy detection and removal in big data: A comprehensive review” In IEEE Access 7 IEEE, 2019, pp. 106328–106344
  14. “On nonlinear dimensionality reduction for face recognition” In Image and Vision Computing 30.4-5 Elsevier, 2012, pp. 355–366
  15. Moulay A Akhloufi, Abdelhakim Bendada and Jean-Christophe Batsale “Multispectral face recognition using non linear dimensionality reduction” In Visual Information Processing XVIII 7341, 2009, pp. 152–161 SPIE
  16. “Learning multi-modal image registration without real data” In arXiv preprint arXiv:2004.10282, 2020
  17. Aditya Dutt, Alina Zare and Paul Gader “Shared Manifold Learning Using a Triplet Network for Multiple Sensor Translation and Fusion With Missing Data” In IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 15 IEEE, 2022, pp. 9439–9456
  18. “Multimodal brain network jointly construction and fusion for diagnosis of epilepsy” In Frontiers in Neuroscience 15 Frontiers, 2021, pp. 734711
  19. “Multi-modal feature selection with self-expression topological manifold learning” In Mathematical Biosciences and Engineering 18.5 American Institute of Mathematical Sciences, 2021, pp. 6468–6482
  20. “Employing Multimodal Machine Learning for Stress Detection” In arXiv preprint arXiv:2306.09385, 2023
  21. “Fusion of Physiological and Behavioural Signals on SPD Manifolds with Application to Stress and Pain Detection” In 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2022, pp. 2949–2955 IEEE
  22. “Deep learning approach for detecting work-related stress using multimodal signals” In IEEE Sensors Journal 22.12 IEEE, 2022, pp. 11892–11902
  23. Yong Zhang, Cheng Cheng and YiDie Zhang “Multimodal emotion recognition based on manifold learning and convolution neural network” In Multimedia Tools and Applications 81.23 Springer, 2022, pp. 33253–33268
  24. K Radhika and V Ramana Murthy Oruganti “Deep multimodal fusion for subject-independent stress detection” In 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence), 2021, pp. 105–109 IEEE
  25. P Sreevidya, S Veni and OV Ramana Murthy “Elder emotion classification through multimodal fusion of intermediate layers and cross-modal transfer learning” In Signal, image and video processing 16.5 Springer, 2022, pp. 1281–1288
  26. “Real-time mental stress detection using multimodality expressions with a deep learning framework” In Frontiers in Neuroscience 16 Frontiers, 2022, pp. 947168
  27. “Deep time-frequency features and semi-supervised dimension reduction for subject-independent emotion recognition from multi-channel EEG signals” In Biomedical Signal Processing and Control 85 Elsevier, 2023, pp. 104806
  28. “Cross-subject EEG emotion recognition using multi-source domain manifold feature selection” In Computers in Biology and Medicine 159 Elsevier, 2023, pp. 106860
  29. “Manifold learning theory and applications” CRC press, 2011
  30. Sam T Roweis and Lawrence K Saul “Nonlinear dimensionality reduction by locally linear embedding” In science 290.5500 American Association for the Advancement of Science, 2000, pp. 2323–2326
  31. “Laplacian eigenmaps for dimensionality reduction and data representation” In Neural computation 15.6 MIT Press, 2003, pp. 1373–1396
  32. Ingwer Borg and Patrick JF Groenen “Modern multidimensional scaling: Theory and applications” Springer Science & Business Media, 2005
  33. Joshua B Tenenbaum, Vin de Silva and John C Langford “A global geometric framework for nonlinear dimensionality reduction” In science 290.5500 American Association for the Advancement of Science, 2000, pp. 2319–2323
  34. Laurens Van der Maaten and Geoffrey Hinton “Visualizing data using t-SNE.” In Journal of machine learning research 9.11, 2008
  35. Ian T Jolliffe and Jorge Cadima “Principal component analysis: a review and recent developments” In Philosophical transactions of the royal society A: Mathematical, Physical and Engineering Sciences 374.2065 The Royal Society Publishing, 2016, pp. 20150202
  36. “EmpathicSchool: A multimodal dataset for real-time facial expressions and physiological data analysis under different stress conditions” In arXiv preprint arXiv:2209.13542, 2022
  37. Davis E King “Dlib-ml: A Machine Learning Toolkit” In Journal of Machine Learning Research 10, 2009, pp. 1755–1758
  38. Fadi Dornaika “Multi-layer manifold learning with feature selection” In Applied Intelligence 50.6 Springer, 2020, pp. 1859–1871
  39. Bo Jiang, Chris Ding and Bin Luo “Robust data representation using locally linear embedding guided PCA” In Neurocomputing 275 Elsevier, 2018, pp. 523–532
Citations (3)

Summary

We haven't generated a summary for this paper yet.