TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Stochastic Latent Residual Video Prediction

Stochastic Latent Residual Video Prediction

Jean-Yves Franceschi, Edouard Delasalles, Mickaël Chen, Sylvain Lamprier, Patrick Gallinari

2020-02-21ICML 2020 1Video PredictionPredictionVideo Generation
PaperPDFCode(official)

Abstract

Designing video prediction models that account for the inherent uncertainty of the future is challenging. Most works in the literature are based on stochastic image-autoregressive recurrent networks, which raises several performance and applicability issues. An alternative is to use fully latent temporal models which untie frame synthesis and temporal dynamics. However, no such model for stochastic video prediction has been proposed in the literature yet, due to design and training difficulties. In this paper, we overcome these difficulties by introducing a novel stochastic temporal model whose dynamics are governed in a latent space by a residual update rule. This first-order scheme is motivated by discretization schemes of differential equations. It naturally models video dynamics as it allows our simpler, more interpretable, latent model to outperform prior state-of-the-art methods on challenging datasets.

Results

TaskDatasetMetricValueModel
VideoBAIR Robot PushingCond2SRVP
VideoBAIR Robot PushingPred28SRVP
VideoBAIR Robot PushingTrain12SRVP
VideoKTHCond10SRVP
VideoKTHPred30SRVP
VideoKTHTrain10SRVP
VideoKTH 64x64 cond10 pred30FVD222SRVP
VideoCityscapes 128x128Cond.10SRVP
VideoCityscapes 128x128Pred20SRVP
Video PredictionKTHCond10SRVP
Video PredictionKTHPred30SRVP
Video PredictionKTHTrain10SRVP
Video PredictionKTH 64x64 cond10 pred30FVD222SRVP
Video PredictionCityscapes 128x128Cond.10SRVP
Video PredictionCityscapes 128x128Pred20SRVP
Video GenerationBAIR Robot PushingCond2SRVP
Video GenerationBAIR Robot PushingPred28SRVP
Video GenerationBAIR Robot PushingTrain12SRVP

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Leveraging Pre-Trained Visual Models for AI-Generated Video Detection2025-07-17Taming Diffusion Transformer for Real-Time Mobile Video Generation2025-07-17LoViC: Efficient Long Video Generation with Context Compression2025-07-17Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15$I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting2025-07-12Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11