TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/MSVD

MSVD

Microsoft Research Video Description Corpus

TextsVideosUnknownIntroduced 2011-01-01

The Microsoft Research Video Description Corpus (MSVD) dataset consists of about 120K sentences collected during the summer of 2010. Workers on Mechanical Turk were paid to watch a short video snippet and then summarize the action in a single sentence. The result is a set of roughly parallel descriptions of more than 2,000 video snippets. Because the workers were urged to complete the task in the language of their choice, both paraphrase and bilingual alternations are captured in the data.

Source: https://www.microsoft.com/en-us/download/details.aspx?id=52422&from=https%3A%2F%2Fresearch.microsoft.com%2Fen-us%2Fdownloads%2F38cf15fd-b8df-477e-a4e4-a4680caa75af%2F Image Source: https://arxiv.org/pdf/1609.06782.pdf

Benchmarks

Video/text-to-video R@1Video/text-to-video R@5Video/text-to-video R@10Video/text-to-video Median RankVideo/text-to-video Mean RankVideo/text-to-video R@50Video/video-to-text R@1Video/video-to-text R@5Video/video-to-text R@10Video/video-to-text Median RankVideo/video-to-text Mean RankVideo Captioning/CIDErVideo Captioning/BLEU-4Video Captioning/METEORVideo Captioning/ROUGE-LVideo Captioning/GSVideo Retrieval/text-to-video R@1Video Retrieval/text-to-video R@5Video Retrieval/text-to-video R@10Video Retrieval/text-to-video Median RankVideo Retrieval/text-to-video Mean RankVideo Retrieval/text-to-video R@50Video Retrieval/video-to-text R@1Video Retrieval/video-to-text R@5Video Retrieval/video-to-text R@10Video Retrieval/video-to-text Median RankVideo Retrieval/video-to-text Mean RankZero-Shot Video Retrieval/text-to-video R@1Zero-Shot Video Retrieval/text-to-video R@5Zero-Shot Video Retrieval/text-to-video R@10Zero-Shot Video Retrieval/text-to-video Median RankZero-Shot Video Retrieval/text-to-video Mean RankZero-Shot Video Retrieval/video-to-text R@1Zero-Shot Video Retrieval/video-to-text R@5Zero-Shot Video Retrieval/video-to-text R@10Zero-Shot Video Retrieval/video-to-text Median Rank

Related Benchmarks

MSVD-CTN/Video Captioning/CIDErMSVD-CTN/Video Captioning/ROUGE-LMSVD-CTN/Video Captioning/SPICEMSVD-Indonesian/10-shot image generation/Mean RankMSVD-Indonesian/10-shot image generation/Median RankMSVD-Indonesian/10-shot image generation/R@1MSVD-Indonesian/10-shot image generation/R@10MSVD-Indonesian/10-shot image generation/R@5MSVD-Indonesian/Text to Video Retrieval/Mean RankMSVD-Indonesian/Text to Video Retrieval/Median RankMSVD-Indonesian/Text to Video Retrieval/R@1MSVD-Indonesian/Text to Video Retrieval/R@10MSVD-Indonesian/Text to Video Retrieval/R@5MSVD-Indonesian/Video/text-to-video Mean RankMSVD-Indonesian/Video/text-to-video Median RankMSVD-Indonesian/Video/text-to-video R@1MSVD-Indonesian/Video/text-to-video R@10MSVD-Indonesian/Video/text-to-video R@5MSVD-Indonesian/Video/video-to-text Mean RankMSVD-Indonesian/Video/video-to-text Median RankMSVD-Indonesian/Video/video-to-text R@1MSVD-Indonesian/Video/video-to-text R@10MSVD-Indonesian/Video/video-to-text R@5MSVD-Indonesian/Video Captioning/BLEU-4MSVD-Indonesian/Video Captioning/CIDErMSVD-Indonesian/Video Captioning/METEORMSVD-Indonesian/Video Captioning/ROUGE-LMSVD-Indonesian/Video Retrieval/text-to-video Mean RankMSVD-Indonesian/Video Retrieval/text-to-video Median RankMSVD-Indonesian/Video Retrieval/text-to-video R@1MSVD-Indonesian/Video Retrieval/text-to-video R@10MSVD-Indonesian/Video Retrieval/text-to-video R@5MSVD-Indonesian/Video Retrieval/video-to-text Mean RankMSVD-Indonesian/Video Retrieval/video-to-text Median RankMSVD-Indonesian/Video Retrieval/video-to-text R@1MSVD-Indonesian/Video Retrieval/video-to-text R@10MSVD-Indonesian/Video Retrieval/video-to-text R@5MSVD-QA/Question Answering/AccuracyMSVD-QA/Question Answering/Confidence ScoreMSVD-QA/Video Question Answering/AccuracyMSVD-QA/Video Question Answering/Confidence ScoreMSVD-QA/Visual Question Answering/AccuracyMSVD-QA/Visual Question Answering (VQA)/AccuracyMSVD-QA/Zero-Shot Learning/Accuracy

Statistics

Papers
327
Benchmarks
36

Links

Homepage

Tasks

VideoVideo CaptioningVideo RetrievalZero-Shot Video Retrieval