Introducing SSBD+ Dataset with a Convolutional Pipeline for detecting Self-Stimulatory Behaviours in Children using raw videos (2311.15072v1)
Abstract: Conventionally, evaluation for the diagnosis of Autism spectrum disorder is done by a trained specialist through questionnaire-based formal assessments and by observation of behavioral cues under various settings to capture the early warning signs of autism. These evaluation techniques are highly subjective and their accuracy relies on the experience of the specialist. In this regard, machine learning-based methods for automated capturing of early signs of autism from the recorded videos of the children is a promising alternative. In this paper, the authors propose a novel pipelined deep learning architecture to detect certain self-stimulatory behaviors that help in the diagnosis of autism spectrum disorder (ASD). The authors also supplement their tool with an augmented version of the Self Stimulatory Behavior Dataset (SSBD) and also propose a new label in SSBD Action detection: no-class. The deep learning model with the new dataset is made freely available for easy adoption to the researchers and developers community. An overall accuracy of around 81% was achieved from the proposed pipeline model that is targeted for real-time and hands-free automated diagnosis. All of the source code, data, licenses of use, and other relevant material is made freely available in https://github.com/sarl-iiitb/
- https://github.com/sarl-iiitb.
- C. Lord, M. Elsabbagh, G. Baird, and J. Veenstra-Vanderweele, “Autism spectrum disorder,” The lancet, vol. 392, no. 10146, pp. 508–520, 2018.
- R. Masiran, “Stimming behaviour in a 4-year-old girl with autism spectrum disorder,” Case Reports, vol. 2018, pp. bcr–2017, 2018.
- I. P. Oono, E. J. Honey, and H. McConachie, “Parent-mediated early intervention for young children with autism spectrum disorders (asd),” Evidence-Based Child Health: A Cochrane Review Journal, vol. 8, no. 6, pp. 2380–2479, 2013.
- J. Zeidan, E. Fombonne, J. Scorah, A. Ibrahim, M. S. Durkin, S. Saxena, A. Yusuf, A. Shih, and M. Elsabbagh, “Global prevalence of autism: a systematic review update,” Autism Research, vol. 15, no. 5, pp. 778–790, 2022.
- S. Rajagopalan, A. Dhall, and R. Goecke, “Self-stimulatory behaviours in the wild for autism diagnosis,” in Proceedings of the IEEE International Conference on Computer Vision Workshops, 2013, pp. 755–761.
- P. Washington, A. Kline, O. C. Mutlu, E. Leblanc, C. Hou, N. Stockham, K. Paskov, B. Chrisman, and D. Wall, “Activity recognition with moving cameras and few training examples: Applications for detection of autism-related headbanging,” in Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems, ser. CHI EA ’21. New York, NY, USA: Association for Computing Machinery, 2021. [Online]. Available: https://doi.org/10.1145/3411763.3451701
- A. Lakkapragada, A. Kline, O. Cezmi Mutlu, K. Paskov, B. Chrisman, N. Stockham, P. Washington, and D. Wall, “Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning Study,” arXiv e-prints, p. arXiv:2108.07917, Aug. 2021.
- C.-H. Min, “Automatic detection and labeling of self-stimulatory behavioral patterns in children with autism spectrum disorder,” in 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 2017, pp. 279–282.
- T. L. Munea, Y. Z. Jembre, H. T. Weldegebriel, L. Chen, C. Huang, and C. Yang, “The progress of human pose estimation: A survey and taxonomy of models applied in 2d human pose estimation,” IEEE Access, vol. 8, pp. 133 330–133 348, 2020.
- Tensorflow, “Movenet: Ultra fast and accurate pose detection model. tensorflow hub,” Dec 2022. [Online]. Available: https://www.tensorflow.org/hub/tutorials/movenet
- C.-Y. Wang, A. Bochkovskiy, and H.-Y. M. Liao, “Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors,” 2022.
- S. Liu and W. Deng, “Very deep convolutional neural network based image classification using small training sample size,” in 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Nov 2015, pp. 730–734.
- “die9origephit/children-vs-adults-images,” 2022. [Online]. Available: https://www.kaggle.com/datasets/die9origephit/children-vs-adults-images
- D. Silva, “Davidtvs/pytorch-lr-finder: A learning rate range test implementation in pytorch,” 2020. [Online]. Available: https://github.com/davidtvs/pytorch-lr-finder
- D. Tran, H. Wang, L. Torresani, J. Ray, Y. LeCun, and M. Paluri, “A closer look at spatiotemporal convolutions for action recognition,” in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2018, pp. 6450–6459.
- V. Lialin, S. Rawls, D. Chan, S. Ghosh, A. Rumshisky, and W. Hamza, “Scalable and accurate self-supervised multimodal representation learning without aligned video and text data,” in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW). IEEE, jan 2023. [Online]. Available: https://doi.org/10.1109%2Fwacvw58289.2023.00043
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” 2015.
- R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad-CAM: Visual explanations from deep networks via gradient-based localization,” International Journal of Computer Vision, vol. 128, no. 2, pp. 336–359, oct 2019. [Online]. Available: https://doi.org/10.1007%2Fs11263-019-01228-7
- F. Chollet, “Xception: Deep learning with depthwise separable convolutions,” 2017.
- A. R. Abdali, “Data efficient video transformer for violence detection,” in 2021 IEEE International Conference on Communication, Networks and Satellite (COMNETSAT). IEEE, 2021, pp. 195–199.
- D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, “Learning spatiotemporal features with 3d convolutional networks,” in Proceedings of the IEEE international conference on computer vision, 2015, pp. 4489–4497.
- L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool, “Temporal segment networks: Towards good practices for deep action recognition,” in European conference on computer vision. Springer, 2016, pp. 20–36.
- S. Chen and Q. Zhao, “Attention-based autism spectrum disorder screening with privileged modality,” 10 2019, pp. 1181–1190.
- G. Hinton, O. Vinyals, and J. Dean, “Distilling the knowledge in a neural network,” 2015, cite arxiv:1503.02531Comment: NIPS 2014 Deep Learning Workshop. [Online]. Available: http://arxiv.org/abs/1503.02531