TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/Charades

Charades

VideosCustom (non-commercial)Introduced 2016-01-01

The Charades dataset is composed of 9,848 videos of daily indoors activities with an average length of 30 seconds, involving interactions with 46 objects classes in 15 types of indoor scenes and containing a vocabulary of 30 verbs leading to 157 action classes. Each video in this dataset is annotated by multiple free-text descriptions, action labels, action intervals and classes of interacting objects. 267 different users were presented with a sentence, which includes objects and actions from a fixed vocabulary, and they recorded a video acting out the sentence. In total, the dataset contains 66,500 temporal annotations for 157 action classes, 41,104 labels for 46 object classes, and 27,847 textual descriptions of the videos. In the standard split there are7,986 training video and 1,863 validation video.

Source: Temporal Reasoning Graph for Activity Recognition

Benchmarks

16k/MAP2D Classification/MAP2D Object Detection/MAP3D/MAPAction Detection/mAPAction Recognition/MAPActivity Recognition/MAPObject Detection/MAPVideo/MAPVideo/FLOPs (G) x viewsVideo/mAPVideo Classification/mAPZero-Shot Action Recognition/mAP

Related Benchmarks

Charades-Ego/Action Recognition/mAPCharades-Ego/Activity Recognition/mAPCharades-STA/10-shot image generation/Recall@SumCharades-STA/Moment Retrieval/R@1 IoU=0.3Charades-STA/Moment Retrieval/R@1 IoU=0.5Charades-STA/Moment Retrieval/R@1 IoU=0.7Charades-STA/Moment Retrieval/R@5 IoU=0.5Charades-STA/Moment Retrieval/R@5 IoU=0.7Charades-STA/Moment Retrieval/mIoUCharades-STA/Temporal Sentence Grounding/R1@0.5Charades-STA/Temporal Sentence Grounding/R1@0.7Charades-STA/Temporal Sentence Grounding/R5@0.5Charades-STA/Temporal Sentence Grounding/R5@0.7Charades-STA/Text to Video Retrieval/Recall@SumCharades-STA/Video/R1@0.5Charades-STA/Video/R1@0.7Charades-STA/Video/R5@0.5Charades-STA/Video/R5@0.7Charades-STA/Video/text-to-video Mean RankCharades-STA/Video/text-to-video Median RankCharades-STA/Video/text-to-video R@1Charades-STA/Video/text-to-video R@10Charades-STA/Video/video-to-text Mean RankCharades-STA/Video/video-to-text Median RankCharades-STA/Video/video-to-text R@1Charades-STA/Video/video-to-text R@10Charades-STA/Video Retrieval/text-to-video Mean RankCharades-STA/Video Retrieval/text-to-video Median RankCharades-STA/Video Retrieval/text-to-video R@1Charades-STA/Video Retrieval/text-to-video R@10Charades-STA/Video Retrieval/video-to-text Mean RankCharades-STA/Video Retrieval/video-to-text Median RankCharades-STA/Video Retrieval/video-to-text R@1Charades-STA/Video Retrieval/video-to-text R@10Charades-STA/Video Understanding/R1@0.5Charades-STA/Video Understanding/R1@0.7Charades-STA/Video Understanding/R5@0.5Charades-STA/Video Understanding/R5@0.7

Statistics

Papers
428
Benchmarks
13

Links

Homepage

Tasks

16k2D Classification2D Object Detection3DAction ClassificationAction DetectionAction RecognitionActivity RecognitionObject DetectionTemporal Action LocalizationVideoVideo ClassificationVideo UnderstandingWeakly Supervised Object DetectionZero-Shot Action Recognition