Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Weakly Supervised Detection of Baby Cry (2304.10001v3)

Published 19 Apr 2023 in cs.CV

Abstract: Detection of baby cries is an important part of baby monitoring and health care. Almost all existing methods use supervised SVM, CNN, or their varieties. In this work, we propose to use weakly supervised anomaly detection to detect a baby cry. In this weak supervision, we only need weak annotation if there is a cry in an audio file. We design a data mining technique using the pre-trained VGGish feature extractor and an anomaly detection network on long untrimmed audio files. The obtained datasets are used to train a simple CNN feature network for cry/non-cry classification. This CNN is then used as a feature extractor in an anomaly detection framework to achieve better cry detection performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. A large-scale benchmark dataset for anomaly detection and rare event classification for audio forensics. IEEE Access, 10:38885–38894, 2022.
  2. Blazeface: Sub-millisecond neural face detection on mobile gpus. In CVPR Workshops, 2019.
  3. A cnn-based method for infant cry detection and recognition. In Workshops of the International Conference on Advanced Information Networking and Applications, 2019.
  4. Baby Cry Detection: Deep Learning and Classical Approaches, pages 171–196. Springer International Publishing, Cham, 2020.
  5. B. Degardin and H. Proença. Human activity analysis: Iterative weak/self-supervised learning frameworks for detecting abnormal events. In 2020 IEEE International Joint Conference on Biometrics (IJCB), pages 1–7. IEEE, 2019.
  6. Analysis of lfcc feature extraction in baby crying classification using knn. In IEEE International Conference on Internet of Things and Intelligence System (IoTaIS), 2019.
  7. Mist: Multiple instance self-training framework for video anomaly detection. CVPR, 2021.
  8. Infant cry detection in adverse acoustic environments by using deep neural networks. In European Signal Processing Conference (EUSIPCO), 2018.
  9. Audio set: An ontology and human-labeled dataset for audio events. In Proc. IEEE ICASSP 2017, New Orleans, LA, 2017.
  10. Giulia. https://github.com/giulbia/baby_cry_detection.
  11. A set of dsp system to detect baby crying. In IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference, 2018.
  12. Cnn architectures for large-scale audio classification. In International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017.
  13. A review of infant cry analysis and classification. EURASIP Journal on Audio, Speech, and Music Processing, 2021(8), 2021.
  14. Description and discussion on dcase2020 challenge task2: Unsupervised anomalous sound detection for machine condition monitoring, 2020.
  15. Biomedical diagnosis of infant cry signal based on analysis of cepstrum by deep feedforward artificial neural networks. IEEE Instrumentation & Measurement Magazine, 24(2):24–29, 2021.
  16. Baby cry detection in domestic environment using deep learning. In IEEE International Conference on the Science of Electrical Engineering, 2016.
  17. Self-training multi-sequence learning with transformer for weakly supervised video anomaly detection. AAAI, 2022.
  18. Rare sound event detection using 1d convolutional recurrent neural networks. In Detection and Classification of Acoustic Scenes and Events 2017 Workshop, 2017.
  19. Localizing anomalies from weakly-labeled videos. IEEE Transactions on Image Processing (TIP), 2021.
  20. Deep learning based effective baby crying recognition method under indoor background sound environments. In International Conference on Computational Systems and Information Technology for Sustainable Solution (CSITSS), 2019.
  21. K. J. Piczak. ESC: Dataset for Environmental Sound Classification. In Proceedings of the 23rd Annual ACM Conference on Multimedia, pages 1015–1018. ACM Press.
  22. Real-world anomaly detection in surveillance videos. In CVPR, 2018.
  23. Weakly-supervised video anomaly detection with robust temporal feature magnitude learning. arXiv preprint arXiv:2101.10030, 2021.
  24. Baby cry sound detection: A comparison of hand crafted features and deep learning approach. In G. Boracchi, L. Iliadis, C. Jayne, and A. Likas, editors, Engineering Applications of Neural Networks, pages 168–179, Cham, 2017. Springer International Publishing.
  25. G. Veres. https://github.com/gveres/donateacry-corpus.
  26. Weakly-supervised spatio-temporal anomaly detection in surveillance video. IJCAI, 2021.
  27. P. Wu and J. Liu. Learning causal temporal relation and feature discrimination for anomaly detection. IEEE Transactions on Image Processing, 30:3513–3527, 2021.
  28. Infant crying detection in real-world environments. In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 131–135, 2022.
  29. Graph convolutional label noise cleaner: Train a plug-and-play action classifier for anomaly detection. In CVPR, 2019.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Weijun Tan (11 papers)
  2. Qi Yao (39 papers)
  3. Jingfeng Liu (18 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.