TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/YouCook2

YouCook2

TextsVideosCustomIntroduced 2018-01-01

YouCook2 is the largest task-oriented, instructional video dataset in the vision community. It contains 2000 long untrimmed videos from 89 cooking recipes; on average, each distinct recipe has 22 videos. The procedure steps for each video are annotated with temporal boundaries and described by imperative English sentences (see the example below). The videos were downloaded from YouTube and are all in the third-person viewpoint. All the videos are unconstrained and can be performed by individual persons at their houses with unfixed cameras. YouCook2 contains rich recipe types and various cooking styles from all over the world.

Source: http://youcook2.eecs.umich.edu/ Image Source: https://competitions.codalab.org/competitions/20594

Benchmarks

Dense Video Captioning/CIDErDense Video Captioning/METEORDense Video Captioning/SODADense Video Captioning/BLEU4Dense Video Captioning/ROUGE-LDense Video Captioning/F1Dense Video Captioning/PrecisionDense Video Captioning/RecallLong Video Retrieval (Background Removed)/Cap. Avg. R@1Long Video Retrieval (Background Removed)/Cap. Avg. R@5Long Video Retrieval (Background Removed)/Cap. Avg. R@10Long Video Retrieval (Background Removed)/DTW R@1Long Video Retrieval (Background Removed)/DTW R@5Long Video Retrieval (Background Removed)/DTW R@10Long Video Retrieval (Background Removed)/OTAM R@1Long Video Retrieval (Background Removed)/OTAM R@5Long Video Retrieval (Background Removed)/OTAM R@10Video/text-to-video R@1Video/text-to-video R@5Video/text-to-video R@10Video/text-to-video Median RankVideo/text-to-video Mean RankVideo/Object Top 5 AccuracyVideo/Object Top-1 AccuracyVideo/Verb Top-1 AccuracyVideo/Verb Top-5 AccuracyVideo Captioning/BLEU-4Video Captioning/BLEU-3Video Captioning/CIDErVideo Captioning/ROUGE-LVideo Captioning/METEORVideo Captioning/SODAVideo Captioning/BLEU4Video Captioning/F1Video Captioning/PrecisionVideo Captioning/RecallVideo Retrieval/text-to-video R@1Video Retrieval/text-to-video R@5Video Retrieval/text-to-video R@10Video Retrieval/text-to-video Median RankVideo Retrieval/text-to-video Mean RankZero-Shot Video Retrieval/text-to-video R@1Zero-Shot Video Retrieval/text-to-video R@5Zero-Shot Video Retrieval/text-to-video R@10Zero-Shot Video Retrieval/text-to-video Mean RankZero-Shot Video Retrieval/text-to-video Median Rank

Statistics

Papers
198
Benchmarks
46

Links

Homepage

Tasks

Action ClassificationDense Video CaptioningLong Video Retrieval (Background Removed)VideoVideo CaptioningVideo RetrievalZero-Shot Video RetrievalZero-Shot Video-Audio RetrievalZero-shot dense video captioning