Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/Charades

Charades

VideosCustom (non-commercial)Introduced 2016-01-01

The Charades dataset is composed of 9,848 videos of daily indoors activities with an average length of 30 seconds, involving interactions with 46 objects classes in 15 types of indoor scenes and containing a vocabulary of 30 verbs leading to 157 action classes. Each video in this dataset is annotated by multiple free-text descriptions, action labels, action intervals and classes of interacting objects. 267 different users were presented with a sentence, which includes objects and actions from a fixed vocabulary, and they recorded a video acting out the sentence. In total, the dataset contains 66,500 temporal annotations for 157 action classes, 41,104 labels for 46 object classes, and 27,847 textual descriptions of the videos. In the standard split there are7,986 training video and 1,863 validation video.

Source: Temporal Reasoning Graph for Activity Recognition

Benchmarks

16k/MAP 2D Classification/MAP 2D Object Detection/MAP 3D/MAP Action Detection/mAP Action Recognition/MAP Activity Recognition/MAP Object Detection/MAP Video/MAP Video/FLOPs (G) x views Video/mAP Video Classification/mAP Zero-Shot Action Recognition/mAP

Related Benchmarks

Charades-Ego/Action Recognition/mAP Charades-Ego/Activity Recognition/mAP Charades-STA/10-shot image generation/Recall@Sum Charades-STA/Moment Retrieval/R@1 IoU=0.3 Charades-STA/Moment Retrieval/R@1 IoU=0.5 Charades-STA/Moment Retrieval/R@1 IoU=0.7 Charades-STA/Moment Retrieval/R@5 IoU=0.5 Charades-STA/Moment Retrieval/R@5 IoU=0.7 Charades-STA/Moment Retrieval/mIoU Charades-STA/Temporal Sentence Grounding/R1@0.5 Charades-STA/Temporal Sentence Grounding/R1@0.7 Charades-STA/Temporal Sentence Grounding/R5@0.5 Charades-STA/Temporal Sentence Grounding/R5@0.7 Charades-STA/Text to Video Retrieval/Recall@Sum Charades-STA/Video/R1@0.5 Charades-STA/Video/R1@0.7 Charades-STA/Video/R5@0.5 Charades-STA/Video/R5@0.7 Charades-STA/Video/text-to-video Mean Rank Charades-STA/Video/text-to-video Median Rank Charades-STA/Video/text-to-video R@1 Charades-STA/Video/text-to-video R@10 Charades-STA/Video/video-to-text Mean Rank Charades-STA/Video/video-to-text Median Rank Charades-STA/Video/video-to-text R@1 Charades-STA/Video/video-to-text R@10 Charades-STA/Video Retrieval/text-to-video Mean Rank Charades-STA/Video Retrieval/text-to-video Median Rank Charades-STA/Video Retrieval/text-to-video R@1 Charades-STA/Video Retrieval/text-to-video R@10 Charades-STA/Video Retrieval/video-to-text Mean Rank Charades-STA/Video Retrieval/video-to-text Median Rank Charades-STA/Video Retrieval/video-to-text R@1 Charades-STA/Video Retrieval/video-to-text R@10 Charades-STA/Video Understanding/R1@0.5 Charades-STA/Video Understanding/R1@0.7 Charades-STA/Video Understanding/R5@0.5 Charades-STA/Video Understanding/R5@0.7

Statistics

Papers: 428
Benchmarks: 13

Links

Tasks

16k 2D Classification 2D Object Detection 3D Action Classification Action Detection Action Recognition Activity Recognition Object Detection Temporal Action Localization Video Video Classification Video Understanding Weakly Supervised Object Detection Zero-Shot Action Recognition