TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MTFL: Multi-Timescale Feature Learning for Weakly-Supervis...

MTFL: Multi-Timescale Feature Learning for Weakly-Supervised Anomaly Detection in Surveillance Videos

Yiling Zhang, Erkut Akdag, Egor Bondarev, Peter H. N. de With

2024-10-08Anomaly Detection In Surveillance VideosVideo Anomaly DetectionAnomaly DetectionSupervised Anomaly DetectionWeakly-supervised Anomaly Detection
PaperPDFCode(official)

Abstract

Detection of anomaly events is relevant for public safety and requires a combination of fine-grained motion information and contextual events at variable time-scales. To this end, we propose a Multi-Timescale Feature Learning (MTFL) method to enhance the representation of anomaly features. Short, medium, and long temporal tubelets are employed to extract spatio-temporal video features using a Video Swin Transformer. Experimental results demonstrate that MTFL outperforms state-of-the-art methods on the UCF-Crime dataset, achieving an anomaly detection performance 89.78% AUC. Moreover, it performs complementary to SotA with 95.32% AUC on the ShanghaiTech and 84.57% AP on the XD-Violence dataset. Furthermore, we generate an extended dataset of the UCF-Crime for development and evaluation on a wider range of anomalies, namely Video Anomaly Detection Dataset (VADD), involving 2,591 videos in 18 classes with extensive coverage of realistic anomalies.

Results

TaskDatasetMetricValueModel
Video UnderstandingVADDROC AUC88.42MTFL (VST, finetuned on VADD)
Video UnderstandingShanghaiTech Weakly SupervisedAUC-ROC95.7MTFL (VST, finetuned on VADD)
Video UnderstandingShanghaiTech Weakly SupervisedAUC-ROC95.32MTFL (VST)
Video UnderstandingUCF-CrimeROC AUC89.78MTFL (VST, finetuned on VADD)
Video UnderstandingUCF-CrimeROC AUC87.16MTFL (VST)
Video UnderstandingXD-ViolenceAP84.57MTFL (VST)
Video UnderstandingXD-ViolenceAP79.4MTFL (VST, finetuned on VADD)
VideoVADDROC AUC88.42MTFL (VST, finetuned on VADD)
VideoShanghaiTech Weakly SupervisedAUC-ROC95.7MTFL (VST, finetuned on VADD)
VideoShanghaiTech Weakly SupervisedAUC-ROC95.32MTFL (VST)
VideoUCF-CrimeROC AUC89.78MTFL (VST, finetuned on VADD)
VideoUCF-CrimeROC AUC87.16MTFL (VST)
VideoXD-ViolenceAP84.57MTFL (VST)
VideoXD-ViolenceAP79.4MTFL (VST, finetuned on VADD)
Anomaly DetectionVADDROC AUC88.42MTFL (VST, finetuned on VADD)
Anomaly DetectionShanghaiTech Weakly SupervisedAUC-ROC95.7MTFL (VST, finetuned on VADD)
Anomaly DetectionShanghaiTech Weakly SupervisedAUC-ROC95.32MTFL (VST)
Anomaly DetectionUCF-CrimeROC AUC89.78MTFL (VST, finetuned on VADD)
Anomaly DetectionUCF-CrimeROC AUC87.16MTFL (VST)
Anomaly DetectionXD-ViolenceAP84.57MTFL (VST)
Anomaly DetectionXD-ViolenceAP79.4MTFL (VST, finetuned on VADD)

Related Papers

Multi-Stage Prompt Inference Attacks on Enterprise LLM Systems2025-07-213DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering2025-07-17A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17A Privacy-Preserving Framework for Advertising Personalization Incorporating Federated Learning and Differential Privacy2025-07-16Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection2025-07-15Adversarial Activation Patching: A Framework for Detecting and Mitigating Emergent Deception in Safety-Aligned Transformers2025-07-12Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects2025-07-10seMCD: Sequentially implemented Monte Carlo depth computation with statistical guarantees2025-07-08