TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Text-to-Video Generation/MSR-VTT

Text-to-Video Generation on MSR-VTT

Metric: FID (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕FID▲Extra DataPaperDate↕Code
1TF-T2V8.19NoA Recipe for Scaling up Text-to-Video Generation...2023-12-25Code
2HiGen8.6NoHierarchical Spatio-temporal Decoupling for Text...2023-12-07Code
3ModelScopeT2V11.09NoModelScope Text-to-Video Technical Report2023-08-12Code
4Video-LaVIT11.27NoVideo-LaVIT: Unified Video-Language Pre-training...2024-02-05Code
5Show-113.08NoShow-1: Marrying Pixel and Latent Diffusion Mode...2023-09-27Code
6Make-A-Video13.17NoMake-A-Video: Text-to-Video Generation without T...2022-09-29Code
7MMVG23.4NoTell Me What Happened: Unifying Text-guided Vide...2022-11-23Code
8CogVideo (English)23.59NoMake-A-Video: Text-to-Video Generation without T...2022-09-29Code
9MagicVideo36.5NoMagicVideo: Efficient Video Generation With Late...2022-11-20-
10NUWA47.68NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code