Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/Assembly101

Assembly101

VideosIntroduced 2022-03-28

Assembly101 is a new procedural activity dataset featuring 4321 videos of people assembling and disassembling 101 "take-apart" toy vehicles. Participants work without fixed instructions, and the sequences feature rich and natural variations in action ordering, mistakes, and corrections. Assembly101 is the first multi-view action dataset, with simultaneous static (8) and egocentric (4) recordings. Sequences are annotated with more than 100K coarse and 1M fine-grained action segments, and 18M 3D hand poses. We benchmark on three action understanding tasks: recognition, anticipation and temporal segmentation. Additionally, we propose a novel task of detecting mistakes. The unique recording format and rich set of annotations allow us to investigate generalization to new toys, cross-view transfer, long-tailed distributions, and pose vs. appearance. We envision that Assembly101 will serve as a new challenge to investigate various activity understanding problems.

Image Source: https://assembly-101.github.io/

Benchmarks

2D Human Pose Estimation/Verbs Recall@5 2D Human Pose Estimation/Objects Recall@5 2D Human Pose Estimation/Actions Recall@5 3D Action Recognition/Actions Top-1 3D Action Recognition/Verbs Top-1 3D Action Recognition/Object Top-1 Action Anticipation/Verbs Recall@5 Action Anticipation/Objects Recall@5 Action Anticipation/Actions Recall@5 Action Localization/Actions Top-1 Action Localization/Verbs Top-1 Action Localization/Object Top-1 Action Localization/F1@10%Action Localization/F1@25%Action Localization/F1@50%Action Localization/Edit Action Localization/MoF Action Recognition/Verbs Recall@5 Action Recognition/Objects Recall@5 Action Recognition/Actions Recall@5 Action Recognition/Actions Top-1 Action Recognition/Verbs Top-1 Action Recognition/Object Top-1 Action Recognition/HM Action Recognition In Videos/Verbs Recall@5 Action Recognition In Videos/Objects Recall@5 Action Recognition In Videos/Actions Recall@5 Action Segmentation/F1@10%Action Segmentation/F1@25%Action Segmentation/F1@50%Action Segmentation/Edit Action Segmentation/MoF Activity Recognition/Verbs Recall@5 Activity Recognition/Objects Recall@5 Activity Recognition/Actions Recall@5 Activity Recognition/Actions Top-1 Activity Recognition/Verbs Top-1 Activity Recognition/Object Top-1 Activity Recognition/HM Temporal Action Localization/Actions Top-1 Temporal Action Localization/Verbs Top-1 Temporal Action Localization/Object Top-1 Video/Actions Top-1 Video/Verbs Top-1 Video/Object Top-1 Zero-Shot Learning/Actions Top-1 Zero-Shot Learning/Verbs Top-1 Zero-Shot Learning/Object Top-1

Statistics

Papers: 57
Benchmarks: 48

Links

Tasks

2D Human Pose Estimation 3D Action Recognition Action Anticipation Action Localization Action Recognition Action Recognition In Videos Action Segmentation Activity Recognition Mistake Detection Open Vocabulary Action Recognition Temporal Action Localization Video Zero-Shot Learning