Street Scene: A new dataset and evaluation protocol for video anomaly detection

Published 15 Feb 2019 in cs.CV | (1902.05872v3)

Abstract: Progress in video anomaly detection research is currently slowed by small datasets that lack a wide variety of activities as well as flawed evaluation criteria. This paper aims to help move this research effort forward by introducing a large and varied new dataset called Street Scene, as well as two new evaluation criteria that provide a better estimate of how an algorithm will perform in practice. In addition to the new dataset and evaluation criteria, we present two variations of a novel baseline video anomaly detection algorithm and show they are much more accurate on Street Scene than two state-of-the-art algorithms from the literature.

Abstract PDF Upgrade to Chat

Authors (2)

Citations (131)

View on Semantic Scholar

Summary

Street Scene: A New Dataset and Evaluation Protocol for Video Anomaly Detection

The paper, "Street Scene: A new dataset and evaluation protocol for video anomaly detection" introduces an innovative approach in video anomaly detection research, an area increasingly gaining significance due to the widespread use of surveillance systems. The authors, Bharathkumar Ramachandra and Michael J. Jones, present a large, varied dataset and novel evaluation criteria which address existing limitations in the field. This work can potentially enhance the realism and applicability of anomaly detection methods.

Advancements in Dataset and Evaluation Criteria

The "Street Scene" dataset is substantial in scope, overcoming the primary challenge of limited dataset variety and volume currently hindering progress in video anomaly detection research. With over 203,000 frames, including 205 anomalous events of 17 types, the dataset offers a diverse set of scenarios recorded in a realistic urban setting. This surpasses previously available datasets such as UCSD Pedestrian and CUHK Avenue, which were critiqued for modest size and lack of scene complexity. Notably, the Street Scene dataset provides detailed annotations with spatial temporal labels and track numbers for anomalous events, contributing to more precise evaluations.

The authors also propose innovative evaluation criteria to more accurately reflect the operational capabilities of anomaly detection algorithms. Traditional criteria like frame-level and pixel-level measures have been criticized for inadequacies, such as not incorporating spatial localization effectively or ignoring multiple false-positive counts within the same frame. To address this, the authors introduce track-based and region-based detection criteria, emphasizing spatial-temporal localization and better quantifying true positive and false positive instances.

Evaluation of Baseline Algorithms

The paper evaluates two variations of a novel baseline algorithm against traditional and new datasets. The baseline methods, utilizing a nearest-neighbor approach with video patches using blurred foreground masks and optical flow features, set new performance benchmarks on the challenging Street Scene dataset. Compared with existing state-of-the-art algorithms such as those by Lu et al. and Hasan et al., the baseline methods demonstrate superior performance.

While the baseline methods excel in detecting distinct motion anomalies such as jaywalking or illegal vehicle maneuvers, detecting static anomalies like loitering remained challenging. Both foreground-mask and flow-based methods displayed their advantages and limitations across different anomaly types and evaluation measures, a nuance essential for tailoring practical anomaly detection tools.

Implications and Future Directions

This dataset and the proposed evaluation framework usher in a new level of realism and rigor in video anomaly detection, with significant implications for surveillance and security domains. By enabling precise anomaly localization and leveraging a more comprehensive dataset, algorithms can be trained and evaluated better for real-world applications.

Future developments can aim to refine anomaly detection techniques to handle static anomalies more effectively, potentially integrating contextual scene understanding. The use of advanced machine learning techniques including deep learning or hybrid models may further enhance detection accuracy and operational efficiency.

Moreover, the dataset opens avenues for benchmarking a wide array of techniques, encouraging a community-driven effort to elevate the standards of anomaly detection research. Researchers can leverage this work to enhance generalization in variable environmental conditions and enhance the adaptability of surveillance systems.

In conclusion, "Street Scene" marks a pivotal contribution to video anomaly detection, aligning research methodology with real-world requirements and pushing forward the potential for practical and accurate surveillance technology deployment.