TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/Kinetics-600 12 frames, 64x64

Video on Kinetics-600 12 frames, 64x64

Metric: Pred (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Pred▼Extra DataPaperDate↕Code
1Video VQ-VAE FVD12NoPredicting Video with VQVAE2021-03-02Code
2SiD211NoSimpler Diffusion (SiD2): 1.5 FID on ImageNet512...2024-10-25-
3LARP11NoLARP: Tokenizing Videos with a Learned Autoregre...2024-10-28Code
4MAGVIT (-L-FP)11NoMAGVIT: Masked Generative Video Transformer2022-12-10Code
5RaMViD11NoDiffusion Models for Video Prediction and Infill...2022-06-15Code
6MAGVIT (-B-FP)11NoMAGVIT: Masked Generative Video Transformer2022-12-10Code
7TriVD-GAN-FP11NoTransformation-based Adversarial Video Predictio...2020-03-09-
8CCVS11NoCCVS: Context-aware Controllable Video Synthesis2021-07-16Code
9DVD-GAN-FP11NoAdversarial Video Generation on Complex Datasets2019-07-15Code
10Video Transformer11NoScaling Autoregressive Video Models2019-06-06Code
11LVT11NoLatent Video Transformer2020-06-18Code