TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/First Order Motion Model for Image Animation

First Order Motion Model for Image Animation

Aliaksandr Siarohin, Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci, Nicu Sebe

2020-02-29NeurIPS 2019 12Video Reconstruction
PaperPDFCode(official)CodeCode

Abstract

Image animation consists of generating a video sequence so that an object in a source image is animated according to the motion of a driving video. Our framework addresses this problem without using any annotation or prior information about the specific object to animate. Once trained on a set of videos depicting objects of the same category (e.g. faces, human bodies), our method can be applied to any object of this class. To achieve this, we decouple appearance and motion information using a self-supervised formulation. To support complex motions, we use a representation consisting of a set of learned keypoints along with their local affine transformations. A generator network models occlusions arising during target motions and combines the appearance extracted from the source image and the motion derived from the driving video. Our framework scores best on diverse benchmarks and on a variety of object categories. Our source code is publicly available.

Results

TaskDatasetMetricValueModel
3DTai-Chi-HDL10.063First Order Motion
Video ReconstructionTai-Chi-HDL10.063First Order Motion

Related Papers

GSVR: 2D Gaussian-based Video Representation for 800+ FPS with Hybrid Deformation Field2025-07-08Quanta Diffusion2025-06-07Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation2025-06-04Compressing Human Body Video with Interactive Semantics: A Generative Approach2025-05-22Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction2025-05-22V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation2025-05-22Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space2025-05-22Few-shot Semantic Encoding and Decoding for Video Surveillance2025-05-12