Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection (2403.01169v2)

Published 2 Mar 2024 in cs.CV

Abstract: Most models for weakly supervised video anomaly detection (WS-VAD) rely on multiple instance learning, aiming to distinguish normal and abnormal snippets without specifying the type of anomaly. However, the ambiguous nature of anomaly definitions across contexts may introduce inaccuracy in discriminating abnormal and normal events. To show the model what is anomalous, a novel framework is proposed to guide the learning of suspected anomalies from event prompts. Given a textual prompt dictionary of potential anomaly events and the captions generated from anomaly videos, the semantic anomaly similarity between them could be calculated to identify the suspected events for each video snippet. It enables a new multi-prompt learning process to constrain the visual-semantic features across all videos, as well as provides a new way to label pseudo anomalies for self-training. To demonstrate its effectiveness, comprehensive experiments and detailed ablation studies are conducted on four datasets, namely XD-Violence, UCF-Crime, TAD, and ShanghaiTech. Our proposed model outperforms most state-of-the-art methods in terms of AP or AUC (86.5\%, \hl{90.4}\%, 94.4\%, and 97.4\%). Furthermore, it shows promising performance in open-set and cross-dataset cases. The data, code, and models can be found at: \url{https://github.com/shiwoaz/lap}.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (115)

Authors (7)

Chenchen Tao (1 paper)
Chong Wang (308 papers)
Xiaohao Peng (1 paper)
Jiafei Wu (15 papers)
Jiangbo Qian (6 papers)
Puning Zhao (21 papers)
Jun Wang (991 papers)

Citations (1)

View on Semantic Scholar

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection (2403.01169v2)

Related Papers