Comparing Self-Supervised Learning Techniques for Wearable Human Activity Recognition (2404.15331v1)
Abstract: Human Activity Recognition (HAR) based on the sensors of mobile/wearable devices aims to detect the physical activities performed by humans in their daily lives. Although supervised learning methods are the most effective in this task, their effectiveness is constrained to using a large amount of labeled data during training. While collecting raw unlabeled data can be relatively easy, annotating data is challenging due to costs, intrusiveness, and time constraints. To address these challenges, this paper explores alternative approaches for accurate HAR using a limited amount of labeled data. In particular, we have adapted recent Self-Supervised Learning (SSL) algorithms to the HAR domain and compared their effectiveness. We investigate three state-of-the-art SSL techniques of different families: contrastive, generative, and predictive. Additionally, we evaluate the impact of the underlying neural network on the recognition rate by comparing state-of-the-art CNN and transformer architectures. Our results show that a Masked Auto Encoder (MAE) approach significantly outperforms other SSL approaches, including SimCLR, commonly considered one of the best-performing SSL methods in the HAR domain. The code and the pre-trained SSL models are publicly available for further research and development.
- Sensor-based activity recognition. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 42(6):790–808, 2012.
- Deep learning for sensor-based activity recognition: A survey. Pattern recognition letters, 119:3–11, 2019.
- Cross-dataset activity recognition via adaptive spatial-temporal transfer learning. in ACM IMWUT, 3(4):1–25, 2019.
- Pre-trained models: Past, present and future. AI Open, 2:225–250, 2021.
- A survey on semi-supervised learning. Machine learning, 109(2):373–440, 2020.
- Assessing the state of self-supervised human activity recognition using wearables. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 6(3):1–47, 2022.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
- Hard negative mixing for contrastive learning. Advances in Neural Information Processing Systems, 33:21798–21809, 2020.
- Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16000–16009, 2022.
- Data2vec: A general framework for self-supervised learning in speech, vision and language. In Int. Conference on Machine Learning, pages 1298–1312. PMLR, 2022.
- Combining public human activity recognition datasets to mitigate labeled data scarcity, 2023.
- Literature review on transfer learning for human activity recognition using mobile and wearable devices with environmental technology. SN Computer Science, 1:1–16, 2020.
- Transfer learning approach for human activity recognition based on continuous wavelet transform. Algorithms, 16(2):77, 2023.
- Exploring semi-supervised and active learning for activity recognition. In 2008 12th IEEE International Symposium on Wearable Computers, pages 81–88, 2008.
- Semi-supervised and personalized federated activity recognition based on active learning and label propagation. Personal and Ubiquitous Computing, 26(5):1281–1298, 2022.
- Selfhar: Improving human activity recognition through self-training with unlabeled data. Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies, 5(1):1–30, 2021.
- Multi-graph based semi-supervised learning for activity recognition. In 2009 international symposium on wearable computers, pages 85–92. IEEE, 2009.
- Activity recognition based on semi-supervised learning. In 13th IEEE international conference on embedded and real-time computing systems and applications (RTCSA 2007), pages 469–475. IEEE, 2007.
- Mang Hong Chan and Mohd Halim Mohd Noor. A unified generative model using generative adversarial network for activity recognition. Journal of Ambient Intelligence and Humanized Computing, 12:8119–8128, 2021.
- Self-supervised speech representation learning: A review. IEEE Journal of Selected Topics in Signal Processing, 2022.
- Bert: Pre-training of deep bidirectional transformers for language understanding. In North American Chapter of the Association for Computational Linguistics, 2019.
- Self-supervised representation learning: Introduction, advances, and challenges. IEEE Signal Processing Magazine, 39(3):42–62, 2022.
- Contrastive self-supervised learning for sensor-based human activity recognition. In 2021 IEEE International Joint Conference on Biometrics (IJCB), pages 1–8. IEEE, 2021.
- Sensor data augmentation by resampling in contrastive learning for human activity recognition. IEEE Sensors Journal, 22(23):22994–23008, 2022.
- Collossl: Collaborative self-supervised learning for human activity recognition. Proc. of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 6(1), 2022.
- Federated self-supervised learning of multisensor representations for embedded intelligence. IEEE Internet of Things Journal, 8(2):1030–1040, 2020.
- Selfact: Personalized activity recognition based on self-supervised and active learning. arXiv preprint arXiv:2304.09530, 2023.
- Masked reconstruction based self-supervision for human activity recognition. In Proceedings of the 2020 ACM International Symposium on Wearable Computers, pages 45–49, 2020.
- Limu-bert: Unleashing the potential of unlabeled data for imu sensing applications. In Proc. of the 19th ACM Conference on Embedded Networked Sensor Systems, pages 220–233, 2021.
- Cross reconstruction transformer for self-supervised time series representation learning. arXiv preprint arXiv:2205.09928, 2022.
- Contrastive predictive coding for human activity recognition. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 5(2):1–26, 2021.
- isplinception: An inception-resnet deep learning architecture for human activity recognition. IEEE Access, 9, 2021.
- Transformer-based models to deal with heterogeneous environments in human activity recognition. Personal and Ubiquitous Computing, pages 1–14, 2023.
- Exploring contrastive learning in human activity recognition for healthcare. arXiv preprint arXiv:2011.11542, 2020.
- Capture-24: Activity tracker dataset for human activity recognition. 2021.
- Large scale population assessment of physical activity using wrist worn accelerometers: the uk biobank study. PloS one, 12(2):e0169649, 2017.
- Smart devices are different: Assessing and mitigating mobile sensing heterogeneities for activity recognition. In Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems, page 127–140, New York, NY, USA, 2015.
- The mobiact dataset: Recognition of activities of daily living using smartphones. In ICT4AgeingWell, pages 143–151, 2016.
- Protecting sensory data against sensitive inferences. In Proceedings of the 1st Workshop on Privacy by Design in Distributed Systems, W-P2DS’18, pages 2:1–2:6, New York, NY, USA, 2018. ACM.
- T. Sztyler and H. Stuckenschmidt. On-body localization of wearable devices: An investigation of position-aware activity recognition. In 2016 IEEE Int. Conference on Pervasive Computing and Communications, pages 1–9, 2016.
- A public domain dataset for human activity recognition using smartphones. In 21st European Symposium on Artificial Neural Networks, ESANN 2013, Bruges, Belgium, April 24-26, 2013, 2013.
- Introducing a new benchmarked dataset for activity monitoring. In 2012 16th International Symposium on Wearable Computers, pages 108–109. IEEE, 2012.
- Human activity recognition using inertial sensors in a smartphone: An overview. Sensors, 19:3213, 07 2019.
- Andrey Ignatov. Real-time human activity recognition from accelerometer data using convolutional neural networks. Applied Soft Computing, 62, 2018.
- Self-supervised learning from images with a joint-embedding predictive architecture. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15619–15629, 2023.
- Sannara Ek (7 papers)
- Riccardo Presotto (4 papers)
- Gabriele Civitarese (15 papers)
- François Portet (29 papers)
- Philippe Lalanda (13 papers)
- Claudio Bettini (18 papers)