TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/AViD Dataset: Anonymized Videos from Diverse Countries

AViD Dataset: Anonymized Videos from Diverse Countries

AJ Piergiovanni, Michael S. Ryoo

2020-07-10NeurIPS 2020 12Action DetectionAction ClassificationAction Recognition
PaperPDFCode(official)

Abstract

We introduce a new public video dataset for action recognition: Anonymized Videos from Diverse countries (AViD). Unlike existing public video datasets, AViD is a collection of action videos from many different countries. The motivation is to create a public dataset that would benefit training and pretraining of action recognition models for everybody, rather than making it useful for limited countries. Further, all the face identities in the AViD videos are properly anonymized to protect their privacy. It also is a static dataset where each video is licensed with the creative commons license. We confirm that most of the existing video datasets are statistically biased to only capture action videos from a limited number of countries. We experimentally illustrate that models trained with such biased datasets do not transfer perfectly to action videos from the other countries, and show that AViD addresses such problem. We also confirm that the new AViD dataset could serve as a good dataset for pretraining the models, performing comparably or better than prior datasets.

Results

TaskDatasetMetricValueModel
VideoAViDAccuracy50.9SlowFast-101 16x8
VideoAViDAccuracy50.5RepFlow ResNet-50
VideoAViDAccuracy50.4SlowFast-50 8x8
VideoAViDAccuracy50.1Two-Stream 3D ResNet-50
VideoAViDAccuracy48.8(2+1)D ResNet-50
VideoAViDAccuracy48.5SlowFast-50 4x4
VideoAViDAccuracy48.23D ResNet-50
VideoAViDAccuracy46.8I3D
VideoAViDAccuracy36.22D ResNet-50
Action DetectionCharadesmAP25.23D ResNet-50 + super-events pretrained on AViD
Action DetectionCharadesmAP23.23D ResNet-50 pretrained on AViD

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment2025-07-01EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception2025-06-26CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment2025-06-25MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans2025-06-25Feature Hallucination for Self-supervised Action Recognition2025-06-25CARMA: Context-Aware Situational Grounding of Human-Robot Group Interactions by Combining Vision-Language Models with Object and Action Recognition2025-06-25Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition2025-06-23