Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

How Suboptimal is Training rPPG Models with Videos and Targets from Different Body Sites? (2403.10582v1)

Published 15 Mar 2024 in eess.IV and cs.LG

Abstract: Remote camera measurement of the blood volume pulse via photoplethysmography (rPPG) is a compelling technology for scalable, low-cost, and accessible assessment of cardiovascular information. Neural networks currently provide the state-of-the-art for this task and supervised training or fine-tuning is an important step in creating these models. However, most current models are trained on facial videos using contact PPG measurements from the fingertip as targets/ labels. One of the reasons for this is that few public datasets to date have incorporated contact PPG measurements from the face. Yet there is copious evidence that the PPG signals at different sites on the body have very different morphological features. Is training a facial video rPPG model using contact measurements from another site on the body suboptimal? Using a recently released unique dataset with synchronized contact PPG and video measurements from both the hand and face, we can provide precise and quantitative answers to this question. We obtain up to 40 % lower mean squared errors between the waveforms of the predicted and the ground truth PPG signals using state-of-the-art neural models when using PPG signals from the forehead compared to using PPG signals from the fingertip. We also show qualitatively that the neural models learn to predict the morphology of the ground truth PPG signal better when trained on the forehead PPG signals. However, while models trained from the forehead PPG produce a more faithful waveform, models trained from a finger PPG do still learn the dominant frequency (i.e., the heart rate) well.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Towards contactless estimation of electrodermal activity correlates. In 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pages 1799–1802. IEEE, 2020.
  2. Near-infrared ccd imaging: Possibilities for noninvasive and contactless 2d mapping of dermal venous hemodynamics. In Optical Diagnostics of Biological Fluids V, pages 2–9. International Society for Optics and Photonics, 2000.
  3. Unsupervised skin tissue segmentation for remote photoplethysmography. Pattern Recognition Letters, 124:82–90, 2019.
  4. Estimation of blood pressure waveform from facial video using a deep u-shaped network and the wavelet representation of imaging photoplethysmographic signals. Biomedical Signal Processing and Control, 78:103895, 2022.
  5. Skin thickness changes in normal aging skin. Gerontology, 36(1):28–35, 1990.
  6. Video-based sympathetic arousal assessment via peripheral blood flow estimation. Biomedical Optics Express, 14(12):6607–6628, 2023.
  7. Deepphys: Video-based physiological measurement using convolutional attention networks. In Proceedings of the European Conference on Computer Vision (ECCV), pages 349–365, 2018.
  8. Camera-based remote photoplethysmography to measure heart rate and blood pressure in ambulatory patients with cardiovascular disease: Preliminary analysis. Journal of the American College of Cardiology, 81(8_Supplement):2301–2301, 2023.
  9. Recovering pulse rate during motion artifact with a multi-imager array for non-contact imaging photoplethysmography. In Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on, pages 1462–1469. IEEE, 2014.
  10. Thomas B Fitzpatrick. The validity and practicality of sun-reactive skin types i through vi. Archives of dermatology, 124(6):869–871, 1988.
  11. The way to my heart is through contrastive learning: Remote photoplethysmography from unlabelled video. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3995–4004, 2021.
  12. Finger and forehead ppg signal comparison for respiratory rate estimation based on pulse amplitude variability. In 2017 25th European Signal Processing Conference (EUSIPCO), pages 2076–2080. IEEE, 2017.
  13. Finger and forehead ppg signal comparison for respiratory rate estimation. Physiological measurement, 40(9):095007, 2019.
  14. Introducing contactless blood pressure assessment using a high speed video camera. Journal of medical systems, 40(4):77, 2016.
  15. Assessment of roi selection for facial video-based rppg. Sensors, 21(23):7923, 2021.
  16. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  17. Lstc-rppg: Long short-term convolutional network for remote photoplethysmography. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6014–6022, 2023.
  18. Multi-task temporal shift attention networks for on-device contactless vitals measurement. NeurIPS, 2020.
  19. rppg-toolbox: Deep remote ppg toolbox. Advances in Neural Information Processing Systems, 36, 2024.
  20. Daniel McDuff. Camera measurement of physiological vital signs. ACM Computing Surveys, 55(9):1–40, 2023.
  21. Non-contact imaging of peripheral hemodynamics during cognitive and psychological stressors. Scientific Reports, 10(1):1–13, 2020.
  22. Ubfc-phys: A multimodal database for psychophysiological studies of social stress. IEEE Transactions on Affective Computing, 2021.
  23. Combined photoplethysmographic monitoring of respiration rate and pulse: a comparison between different measurement sites in spontaneously breathing subjects. Acta Anaesthesiologica Scandinavica, 51(9):1250–1257, 2007.
  24. Full-body cardiovascular sensing with remote photoplethysmography. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5993–6003, 2023.
  25. Vipl-hr: A multi-modal database for pulse estimation from less-constrained face video. In Computer Vision–ACCV 2018: 14th Asian Conference on Computer Vision, Perth, Australia, December 2–6, 2018, Revised Selected Papers, Part V 14, pages 562–576. Springer, 2019.
  26. Advancements in noncontact, multiparameter physiological measurements using a webcam. IEEE transactions on biomedical engineering, 58(1):7–11, 2010a.
  27. Non-contact, automated cardiac pulse measurements using video imaging and blind source separation. Optics express, 18(10):10762–10774, 2010b.
  28. Assessment of non-invasive blood pressure prediction from ppg and rppg signals using deep learning. Sensors, 21(18):6022, 2021.
  29. Perinasal imaging of physiological stress and its affective potential. IEEE Transactions on Affective Computing, 3(3):366–378, 2012.
  30. Super-convergence: Very fast training of neural networks using large learning rates. In Artificial intelligence and machine learning for multi-domain operations applications, pages 369–386. SPIE, 2019.
  31. Visual heart rate estimation with convolutional neural network. In Proceedings of the british machine vision conference, Newcastle, UK, pages 3–6, 2018.
  32. Operational definition of normal sinus heart rate. The American journal of cardiology, 69(14):1245–1246, 1992.
  33. Non-contact video-based pulse rate measurement on a mobile service robot. In The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pages 1056–1062. IEEE, 2014.
  34. Heart rate measurement based on a time-lapse image. Medical engineering & physics, 29(8):853–857, 2007.
  35. Non-contact video-based vital sign monitoring using ambient light and auto-regressive models. Physiological measurement, 35(5):807, 2014.
  36. Remote plethysmographic imaging using ambient light. Optics express, 16(26):21434–21445, 2008.
  37. Algorithmic principles of remote ppg. IEEE Transactions on Biomedical Engineering, 64(7):1479–1491, 2017.
  38. Simper: Simple self-supervised learning of periodic targets. arXiv preprint arXiv:2210.03115, 2022.
  39. Remote heart rate measurement from highly compressed facial videos: an end-to-end deep learning solution with video enhancement. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 151–160, 2019.
  40. Physformer: Facial video-based physiological measurement with temporal difference transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4186–4196, 2022.
  41. Multimodal spontaneous emotion corpus for human behavior analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3438–3446, 2016.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Björn Braun (4 papers)
  2. Daniel McDuff (88 papers)
  3. Christian Holz (34 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.