Detection of Object Throwing Behavior in Surveillance Videos (2403.06552v1)
Abstract: Anomalous behavior detection is a challenging research area within computer vision. Progress in this area enables automated detection of dangerous behavior using surveillance camera feeds. A dangerous behavior that is often overlooked in other research is the throwing action in traffic flow, which is one of the unique requirements of our Smart City project to enhance public safety. This paper proposes a solution for throwing action detection in surveillance videos using deep learning. At present, datasets for throwing actions are not publicly available. To address the use-case of our Smart City project, we first generate the novel public 'Throwing Action' dataset, consisting of 271 videos of throwing actions performed by traffic participants, such as pedestrians, bicyclists, and car drivers, and 130 normal videos without throwing actions. Second, we compare the performance of different feature extractors for our anomaly detection method on the UCF-Crime and Throwing-Action datasets. The explored feature extractors are the Convolutional 3D (C3D) network, the Inflated 3D ConvNet (I3D) network, and the Multi-Fiber Network (MFNet). Finally, the performance of the anomaly detection algorithm is improved by applying the Adam optimizer instead of Adadelta, and proposing a mean normal loss function that covers the multitude of normal situations in traffic. Both aspects yield better anomaly detection performance. Besides this, the proposed mean normal loss function lowers the false alarm rate on the combined dataset. The experimental results reach an area under the ROC curve of 86.10 for the Throwing-Action dataset, and 80.13 on the combined dataset, respectively.
- J. Carreira and A. Zisserman, “Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset,” in CVPR, 2017, pp. 6299–6308.
- Y. Chang, Z. Tu, W. Xie, and J. Yuan, “Clustering Driven Deep Autoencoder for Video Anomaly Detection,” in Computer Vision – ECCV 2020. ECCV 2020. Lecture Notes in Computer Science, vol. 12360 LNCS. Springer, Cham, 8 2020, pp. 329–345. [Online]. Available: https://link.springer.com/chapter/10.1007/978-3-030-58555-6_20
- Y. Chen, Y. Kalantidis, J. Li, S. Yan, and J. Feng, “Multi-Fiber Networks for Video Recognition,” in ECCV, 2018, pp. 352–367.
- R. Csordás, L. Havasi, and T. Szirányi, “Detecting objects thrown over fence in outdoor scenes,” in Proc. Int. Conf. Comput. Vision Theory Applicat. (VISAPP), Berlin, 2015, pp. 593–599. [Online]. Available: http://eprints.sztaki.hu/8636/1/Csordas_593_2856278_ny.pdf
- Z. Dai and Z. Zheng, “A YOLOv3-Based Learning Strategy for Vehicle-Thrown-Waste Identification,” in ICIC 2021: Intelligent Computing Theories and Application. Springer, Cham, 8 2021, pp. 305–315. [Online]. Available: https://link.springer.com/chapter/10.1007/978-3-030-84529-2_26
- K. Deepak, S. Chandrakala, and C. K. Mohan, “Residual spatiotemporal autoencoder for unsupervised video anomaly detection,” Signal, Image and Video Processing, vol. 15, no. 1, pp. 215–222, 7 2020. [Online]. Available: https://link.springer.com/article/10.1007/s11760-020-01740-1
- S. Ioffe and C. Szegedy, “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift,” in Proceedings of the 32nd International Conference on Machine Learning, PMLR, 2015, pp. 448–456.
- A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and F. F. Li, “Large-scale video classification with convolutional neural networks,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, 9 2014, pp. 1725–1732. [Online]. Available: https://paperswithcode.com/paper/large-scale-video-classification-with-1
- E. Kosman, “Pytorch implementation of Real-World Anomaly Detection in Surveillance Videos.” [Online]. Available: https://github.com/ekosman/AnomalyDetectionCVPR2018-Pytorch
- Z. Li, Y. Li, and Z. Gao, “Spatiotemporal Representation Learning for Video Anomaly Detection,” IEEE Access, vol. 8, pp. 25 531–25 542, 2020.
- K. Liu and H. Ma, “Exploring background-bias for anomaly detection in surveillance videos,” in MM 2019 - Proceedings of the 27th ACM International Conference on Multimedia. Association for Computing Machinery, Inc, 10 2019, pp. 1490–1499. [Online]. Available: https://doi.org/10.1145/3343031.3350998
- H. Lv, C. Zhou, Z. Cui, C. Xu, Y. Li, and J. Yang, “Localizing Anomalies from Weakly-Labeled Videos,” IEEE Transactions on Image Processing, vol. 30, pp. 4505–4515, 2021.
- N. Nasaruddin, K. Muchtar, A. Afdhal, and A. P. J. Dwiyantoro, “Deep anomaly detection through visual attention in surveillance videos,” Journal of Big Data, vol. 7, no. 1, pp. 1–17, 10 2020. [Online]. Available: https://journalofbigdata.springeropen.com/articles/10.1186/s40537-020-00365-y
- R. Nawaratne, D. Alahakoon, D. De Silva, and X. Yu, “Spatiotemporal anomaly detection using deep learning for real-time video surveillance,” IEEE Transactions on Industrial Informatics, vol. 16, no. 1, pp. 393–402, 1 2020.
- T.-N. Nguyen and J. Meunier, “Anomaly Detection in Video Sequence With Appearance-Motion Correspondence,” in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, 2019, pp. 1273–1283.
- S. Petrocchi, G. Giorgi, and M. G. Cimino, “A Real-Time Deep Learning Approach for Real-World Video Anomaly Detection,” in ARES 2021: The 16th International Conference on Availability, Reliability and Security. Association for Computing Machinery, 8 2021, pp. 1–9.
- J. Redmon and A. Farhadi, “YOLOv3: An Incremental Improvement,” arXiv, 4 2018. [Online]. Available: https://arxiv.org/abs/1804.02767v1
- E. Ribnick, S. Atev, N. Papanikolopoulos, O. Masoud, and R. Voyles, “Detection of thrown objects in indoor and outdoor scenes,” in IEEE International Conference on Intelligent Robots and Systems, 2007, pp. 979–984.
- M. Sabokrou, M. Fayyaz, M. Fathy, Z. Moayed, and R. Klette, “Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes,” Computer Vision and Image Understanding, vol. 172, pp. 88–97, 7 2018.
- G. A. Sigurdsson, G. Varol, X. Wang, A. Farhadi, I. Laptev, and A. Gupta, “Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding,” in ECCV, vol. 9905 LNCS. Springer, Cham, 2016, pp. 510–526. [Online]. Available: https://link.springer.com/chapter/10.1007/978-3-319-46448-0_31
- P. Singh and V. Pankajakshan, “A Deep Learning Based Technique for Anomaly Detection in Surveillance Videos,” in 2018 24th National Conference on Communications, NCC 2018. Institute of Electrical and Electronics Engineers Inc., 1 2019.
- K. Soomro, A. R. Zamir, and M. Shah, “UCF101: A dataset of 101 human actions classes from videos in the wild,” CoRR, vol. abs/1212.0402, 2012. [Online]. Available: http://arxiv.org/abs/1212.0402
- W. Sultani, C. Chen, and M. Shah, “Real-World Anomaly Detection in Surveillance Videos,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2018, pp. 6479–6488. [Online]. Available: http://crcv.ucf.edu/projects/real-world/
- Y. Tian, G. Pang, Y. Chen, R. Singh, J. W. Verjans, and G. Carneiro, “Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning,” in International Conference for Computer Vision, 2021.
- D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, “Learning Spatiotemporal Features With 3D Convolutional Networks,” in ICCV, 2015, pp. 4489–4497.
- W. Ullah, A. Ullah, I. U. Haq, K. Muhammad, M. Sajjad, and S. W. Baik, “CNN features with bi-directional LSTM for real-time anomaly detection in surveillance networks,” Multimedia Tools and Applications, vol. 80, no. 11, pp. 16 979–16 995, 8 2020. [Online]. Available: https://link.springer.com/article/10.1007/s11042-020-09406-3
- J. T. Zhou, J. Du, H. Zhu, X. Peng, Y. Liu, and R. S. M. Goh, “AnomalyNet: An Anomaly Detection Network for Video Surveillance,” IEEE Transactions on Information Forensics and Security, vol. 14, no. 10, pp. 2537–2550, 10 2019.