TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SimVP: Simpler yet Better Video Prediction

SimVP: Simpler yet Better Video Prediction

Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

2022-06-09CVPR 2022 1Video PredictionPrediction
PaperPDFCode(official)CodeCode

Abstract

From CNN, RNN, to ViT, we have witnessed remarkable advancements in video prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies. We admire these progresses but are confused about the necessity: is there a simple method that can perform comparably well? This paper proposes SimVP, a simple video prediction model that is completely built upon CNN and trained by MSE loss in an end-to-end fashion. Without introducing any additional tricks and complicated strategies, we can achieve state-of-the-art performance on five benchmark datasets. Through extended experiments, we demonstrate that SimVP has strong generalization and extensibility on real-world datasets. The significant reduction of training cost makes it easier to scale to complex scenarios. We believe SimVP can serve as a solid baseline to stimulate the further development of video prediction. The code is available at \href{https://github.com/gaozhangyang/SimVP-Simpler-yet-Better-Video-Prediction}{Github}.

Results

TaskDatasetMetricValueModel
VideoMoving MNISTMSE23.8SimVP
VideoMoving MNISTSSIM0.948SimVP
VideoHuman3.6MMAE1510SimVP
VideoHuman3.6MMSE316SimVP
VideoHuman3.6MSSIM0.904SimVP
Video PredictionMoving MNISTMSE23.8SimVP
Video PredictionMoving MNISTSSIM0.948SimVP
Video PredictionHuman3.6MMAE1510SimVP
Video PredictionHuman3.6MMSE316SimVP
Video PredictionHuman3.6MSSIM0.904SimVP

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11Foundation models for time series forecasting: Application in conformal prediction2025-07-09Predicting Graph Structure via Adapted Flux Balance Analysis2025-07-08Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis2025-07-08A Wireless Foundation Model for Multi-Task Prediction2025-07-08High Order Collaboration-Oriented Federated Graph Neural Network for Accurate QoS Prediction2025-07-07