Tasks
SotA
Datasets
Papers
Methods
Submit
About
Datasets
/
MSR
MSR
Benchmarks
Chinese
/
F1
Chinese
/
Precision
Chinese
/
Recall
Related Benchmarks
MSR Action3D
/
3D Action Recognition
/
Accuracy
MSR Action3D
/
Action Detection
/
Accuracy
MSR Action3D
/
Action Localization
/
Accuracy
MSR Action3D
/
Action Recognition
/
Accuracy
MSR Action3D
/
Activity Recognition
/
Accuracy
MSR Action3D
/
Temporal Action Localization
/
Accuracy
MSR Action3D
/
Video
/
Accuracy
MSR Action3D
/
Zero-Shot Learning
/
Accuracy
MSR ActionPairs
/
3D Action Recognition
/
Accuracy
MSR ActionPairs
/
Action Detection
/
Accuracy
MSR ActionPairs
/
Action Localization
/
Accuracy
MSR ActionPairs
/
Action Recognition
/
Accuracy
MSR ActionPairs
/
Activity Recognition
/
Accuracy
MSR ActionPairs
/
Temporal Action Localization
/
Accuracy
MSR ActionPairs
/
Video
/
Accuracy
MSR ActionPairs
/
Zero-Shot Learning
/
Accuracy
MSR Daily Activity3D dataset
/
Activity Recognition
/
Accuracy
MSR-VTT
/
10-shot image generation
/
text-to-video R@1
MSR-VTT
/
Text to Video Retrieval
/
text-to-video R@1
MSR-VTT
/
Text-to-Video Generation
/
CLIP-FID
MSR-VTT
/
Text-to-Video Generation
/
CLIPSIM
MSR-VTT
/
Text-to-Video Generation
/
FID
MSR-VTT
/
Text-to-Video Generation
/
FVD
MSR-VTT
/
Video
/
FVD16
MSR-VTT
/
Video
/
Inception score
MSR-VTT
/
Video
/
text-to-video Mean Rank
MSR-VTT
/
Video
/
text-to-video Median Rank
MSR-VTT
/
Video
/
text-to-video MedianR
MSR-VTT
/
Video
/
text-to-video R@1
MSR-VTT
/
Video
/
text-to-video R@10
MSR-VTT
/
Video
/
text-to-video R@5
MSR-VTT
/
Video
/
text-to-videoMedian Rank
MSR-VTT
/
Video
/
video-to-text Mean Rank
MSR-VTT
/
Video
/
video-to-text Median Rank
MSR-VTT
/
Video
/
video-to-text R@1
MSR-VTT
/
Video
/
video-to-text R@10
MSR-VTT
/
Video
/
video-to-text R@5
MSR-VTT
/
Video Captioning
/
BLEU-4
MSR-VTT
/
Video Captioning
/
CIDEr
MSR-VTT
/
Video Captioning
/
GS
MSR-VTT
/
Video Captioning
/
METEOR
MSR-VTT
/
Video Captioning
/
ROUGE-L
MSR-VTT
/
Video Generation
/
FVD16
MSR-VTT
/
Video Generation
/
Inception score
MSR-VTT
/
Video Question Answering
/
Accuracy
MSR-VTT
/
Video Retrieval
/
text-to-video Mean Rank
MSR-VTT
/
Video Retrieval
/
text-to-video Median Rank
MSR-VTT
/
Video Retrieval
/
text-to-video MedianR
MSR-VTT
/
Video Retrieval
/
text-to-video R@1
MSR-VTT
/
Video Retrieval
/
text-to-video R@10
MSR-VTT
/
Video Retrieval
/
text-to-video R@5
MSR-VTT
/
Video Retrieval
/
text-to-videoMedian Rank
MSR-VTT
/
Video Retrieval
/
video-to-text Mean Rank
MSR-VTT
/
Video Retrieval
/
video-to-text Median Rank
MSR-VTT
/
Video Retrieval
/
video-to-text R@1
MSR-VTT
/
Video Retrieval
/
video-to-text R@10
MSR-VTT
/
Video Retrieval
/
video-to-text R@5
MSR-VTT
/
Zero-Shot Video Retrieval
/
text-to-video Mean Rank
MSR-VTT
/
Zero-Shot Video Retrieval
/
text-to-video Median Rank
MSR-VTT
/
Zero-Shot Video Retrieval
/
text-to-video R@1
MSR-VTT
/
Zero-Shot Video Retrieval
/
text-to-video R@10
MSR-VTT
/
Zero-Shot Video Retrieval
/
text-to-video R@5
MSR-VTT
/
Zero-Shot Video Retrieval
/
video-to-text Median Rank
MSR-VTT
/
Zero-Shot Video Retrieval
/
video-to-text R@1
MSR-VTT
/
Zero-Shot Video Retrieval
/
video-to-text R@10
MSR-VTT
/
Zero-Shot Video Retrieval
/
video-to-text R@5
MSR-VTT Adverbs
/
Video
/
Acc-A
MSR-VTT Adverbs
/
Video
/
mAP M
MSR-VTT Adverbs
/
Video
/
mAP W
MSR-VTT Adverbs
/
Video Retrieval
/
Acc-A
MSR-VTT Adverbs
/
Video Retrieval
/
mAP M
MSR-VTT Adverbs
/
Video Retrieval
/
mAP W
MSR-VTT Adverbs
/
Video-Adverb Retrieval
/
Acc-A
MSR-VTT Adverbs
/
Video-Adverb Retrieval
/
mAP M
MSR-VTT Adverbs
/
Video-Adverb Retrieval
/
mAP W
MSR-VTT-1kA
/
Video
/
text-to-video Mean Rank
MSR-VTT-1kA
/
Video
/
text-to-video Median Rank
MSR-VTT-1kA
/
Video
/
text-to-video R@1
MSR-VTT-1kA
/
Video
/
text-to-video R@10
MSR-VTT-1kA
/
Video
/
text-to-video R@5
MSR-VTT-1kA
/
Video
/
video-to-text Mean Rank
MSR-VTT-1kA
/
Video
/
video-to-text Median Rank
MSR-VTT-1kA
/
Video
/
video-to-text R@1
MSR-VTT-1kA
/
Video
/
video-to-text R@10
MSR-VTT-1kA
/
Video
/
video-to-text R@5
MSR-VTT-1kA
/
Video Retrieval
/
text-to-video Mean Rank
MSR-VTT-1kA
/
Video Retrieval
/
text-to-video Median Rank
MSR-VTT-1kA
/
Video Retrieval
/
text-to-video R@1
MSR-VTT-1kA
/
Video Retrieval
/
text-to-video R@10
MSR-VTT-1kA
/
Video Retrieval
/
text-to-video R@5
MSR-VTT-1kA
/
Video Retrieval
/
video-to-text Mean Rank
MSR-VTT-1kA
/
Video Retrieval
/
video-to-text Median Rank
MSR-VTT-1kA
/
Video Retrieval
/
video-to-text R@1
MSR-VTT-1kA
/
Video Retrieval
/
video-to-text R@10
MSR-VTT-1kA
/
Video Retrieval
/
video-to-text R@5
MSR-VTT-MC
/
Video Question Answering
/
Accuracy
MSR-VTT-full
/
Zero-Shot Video Retrieval
/
text-to-video R@1
MSR-VTT-full
/
Zero-Shot Video Retrieval
/
text-to-video R@10
MSR-VTT-full
/
Zero-Shot Video Retrieval
/
text-to-video R@5
MSR-VTT-full
/
Zero-Shot Video Retrieval
/
video-to-text R@1
MSR-VTT-full
/
Zero-Shot Video Retrieval
/
video-to-text R@10
MSR-VTT-full
/
Zero-Shot Video Retrieval
/
video-to-text R@5
MSRA
/
Chinese
/
F1
MSRA
/
Cross-Lingual
/
F1
MSRA
/
Cross-Lingual Transfer
/
F1
MSRA
/
Named Entity Recognition (NER)
/
F1
MSRA
/
Named Entity Recognition (NER)
/
Precision
MSRA
/
Named Entity Recognition (NER)
/
Recall
MSRA Dev
/
Named Entity Recognition (NER)
/
F1
MSRA Hands
/
1 Image, 2*2 Stitchi
/
Average 3D Error
MSRA Hands
/
3D
/
Average 3D Error
MSRA Hands
/
Hand
/
Average 3D Error
MSRA Hands
/
Hand Pose Estimation
/
Average 3D Error
MSRA Hands
/
Pose Estimation
/
Average 3D Error
MSRA-TD500
/
Scene Text Detection
/
F-Measure
MSRA-TD500
/
Scene Text Detection
/
FPS
MSRA-TD500
/
Scene Text Detection
/
Precision
MSRA-TD500
/
Scene Text Detection
/
Recall
MSRC-12
/
3D Action Recognition
/
Accuracy
MSRC-12
/
Action Detection
/
Accuracy
MSRC-12
/
Action Localization
/
Accuracy
MSRC-12
/
Action Recognition
/
Accuracy
MSRC-12
/
Activity Recognition
/
Accuracy
MSRC-12
/
Gesture Recognition
/
Accuracy
MSRC-12
/
Temporal Action Localization
/
Accuracy
MSRC-12
/
Video
/
Accuracy
MSRC-12
/
Zero-Shot Learning
/
Accuracy
MSRC-21 (per-class)
/
Classification
/
Accuracy (10 fold)
MSRC-21 (per-class)
/
Graph Classification
/
Accuracy (10 fold)
MSRP
/
Paraphrase Identification
/
Accuracy
MSRP
/
Paraphrase Identification
/
F1
MSRP
/
Semantic Textual Similarity
/
Accuracy
MSRP
/
Semantic Textual Similarity
/
F1
MSRVTT-CTN
/
Video Captioning
/
CIDEr
MSRVTT-CTN
/
Video Captioning
/
ROUGE-L
MSRVTT-CTN
/
Video Captioning
/
SPICE
MSRVTT-MC
/
Video Question Answering
/
Accuracy
MSRVTT-QA
/
Question Answering
/
Accuracy
MSRVTT-QA
/
Question Answering
/
Confidence Score
MSRVTT-QA
/
Video Question Answering
/
Accuracy
MSRVTT-QA
/
Video Question Answering
/
Confidence Score
MSRVTT-QA
/
Visual Question Answering
/
Accuracy
MSRVTT-QA
/
Visual Question Answering
/
Test Accuracy
MSRVTT-QA
/
Visual Question Answering (VQA)
/
Accuracy
MSRVTT-QA
/
Visual Question Answering (VQA)
/
Test Accuracy
MSRVTT-QA
/
Zero-Shot Learning
/
Accuracy