Laura Kart, Niv Cohen
In recent years, many works have addressed the problem of finding never-seen-before anomalies in videos. Yet, most work has been focused on detecting anomalous frames in surveillance videos taken from security cameras. Meanwhile, the task of anomaly detection (AD) in videos exhibiting anomalous mechanical behavior, has been mostly overlooked. Anomaly detection in such videos is both of academic and practical interest, as they may enable automatic detection of malfunctions in many manufacturing, maintenance, and real-life settings. To assess the potential of the different approaches to detect such anomalies, we evaluate two simple baseline approaches: (i) Temporal-pooled image AD techniques. (ii) Density estimation of videos represented with features pretrained for video-classification. Development of such methods calls for new benchmarks to allow evaluation of different possible approaches. We introduce the Physical Anomalous Trajectory or Motion (PHANTOM) dataset, which contains six different video classes. Each class consists of normal and anomalous videos. The classes differ in the presented phenomena, the normal class variability, and the kind of anomalies in the videos. We also suggest an even harder benchmark where anomalous activities should be spotted on highly variable scenes.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Anomaly Detection | PHANTOM | Avg. ROC-AUC | 0.78 | Pooled Image Level kNN |
| Anomaly Detection | PHANTOM | Avg. ROC-AUC | 0.76 | Video Level features kNN |
| Anomaly Detection | Something-Something V2 | Avg. ROC-AUC | 0.58 | Pooled Image Level kNN |
| Anomaly Detection | Something-Something V2 | Avg. ROC-AUC | 0.52 | Video Level features kNN |
| Abnormal Event Detection In Video | PHANTOM | Avg. ROC-AUC | 0.78 | Pooled Image Level kNN |
| Abnormal Event Detection In Video | PHANTOM | Avg. ROC-AUC | 0.76 | Video Level features kNN |
| Abnormal Event Detection In Video | Something-Something V2 | Avg. ROC-AUC | 0.58 | Pooled Image Level kNN |
| Abnormal Event Detection In Video | Something-Something V2 | Avg. ROC-AUC | 0.52 | Video Level features kNN |
| Semi-supervised Anomaly Detection | PHANTOM | Avg. ROC-AUC | 0.78 | Pooled Image Level kNN |
| Semi-supervised Anomaly Detection | PHANTOM | Avg. ROC-AUC | 0.76 | Video Level features kNN |
| Semi-supervised Anomaly Detection | Something-Something V2 | Avg. ROC-AUC | 0.58 | Pooled Image Level kNN |
| Semi-supervised Anomaly Detection | Something-Something V2 | Avg. ROC-AUC | 0.52 | Video Level features kNN |