TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Motion Representations for Articulated Animation

Motion Representations for Articulated Animation

Aliaksandr Siarohin, Oliver J. Woodford, Jian Ren, Menglei Chai, Sergey Tulyakov

2021-04-22CVPR 2021 1Video Reconstruction
PaperPDFCodeCode(official)

Abstract

We propose novel motion representations for animating articulated objects consisting of distinct parts. In a completely unsupervised manner, our method identifies object parts, tracks them in a driving video, and infers their motions by considering their principal axes. In contrast to the previous keypoint-based works, our method extracts meaningful and consistent regions, describing locations, shape, and pose. The regions correspond to semantically relevant and distinct object parts, that are more easily detected in frames of the driving video. To force decoupling of foreground from background, we model non-object related global motion with an additional affine transformation. To facilitate animation and prevent the leakage of the shape of the driving object, we disentangle shape and pose of objects in the region space. Our model can animate a variety of objects, surpassing previous methods by a large margin on existing benchmarks. We present a challenging new benchmark with high-resolution videos and show that the improvement is particularly pronounced when articulated objects are considered, reaching 96.6% user preference vs. the state of the art.

Results

TaskDatasetMetricValueModel
3DTai-Chi-HD (256)AED0.152Siarohin et al.
3DTai-Chi-HD (256)AKD5.58Siarohin et al.
3DTai-Chi-HD (256)L10.047Siarohin et al.
3DTai-Chi-HD (256)MKR0.027Siarohin et al.
3DTai-Chi-HD (256)AED0.172FOMM
3DTai-Chi-HD (256)AKD6.53FOMM
3DTai-Chi-HD (256)L10.056FOMM
3DTai-Chi-HD (256)MKR0.033FOMM
3DVoxCelebAED0.133Siarohin et al.
3DVoxCelebAKD1.28Siarohin et al.
3DVoxCelebL10.04Siarohin et al.
3DVoxCelebAED0.134FOMM
3DVoxCelebAKD1.27FOMM
3DVoxCelebL10.041FOMM
3DTai-Chi-HD (512)AED0.172Siarohin et al.
3DTai-Chi-HD (512)AKD13.86Siarohin et al.
3DTai-Chi-HD (512)L10.064Siarohin et al.
3DTai-Chi-HD (512)MKR0.043Siarohin et al.
3DTai-Chi-HD (512)AED0.203FOMM
3DTai-Chi-HD (512)AKD17.12FOMM
3DTai-Chi-HD (512)L10.075FOMM
3DTai-Chi-HD (512)MKR0.066FOMM
3DMGifL10.0206Siarohin et al.
3DMGifL10.0223FOMM
3DTED-talksAED0.114Siarohin et al.
3DTED-talksAKD3.75Siarohin et al.
3DTED-talksL10.026Siarohin et al.
3DTED-talksMKR0.007Siarohin et al.
3DTED-talksAED0.163FOMM
3DTED-talksAKD7.07FOMM
3DTED-talksL10.033FOMM
3DTED-talksMKR0.014FOMM
Video ReconstructionTai-Chi-HD (256)AED0.152Siarohin et al.
Video ReconstructionTai-Chi-HD (256)AKD5.58Siarohin et al.
Video ReconstructionTai-Chi-HD (256)L10.047Siarohin et al.
Video ReconstructionTai-Chi-HD (256)MKR0.027Siarohin et al.
Video ReconstructionTai-Chi-HD (256)AED0.172FOMM
Video ReconstructionTai-Chi-HD (256)AKD6.53FOMM
Video ReconstructionTai-Chi-HD (256)L10.056FOMM
Video ReconstructionTai-Chi-HD (256)MKR0.033FOMM
Video ReconstructionVoxCelebAED0.133Siarohin et al.
Video ReconstructionVoxCelebAKD1.28Siarohin et al.
Video ReconstructionVoxCelebL10.04Siarohin et al.
Video ReconstructionVoxCelebAED0.134FOMM
Video ReconstructionVoxCelebAKD1.27FOMM
Video ReconstructionVoxCelebL10.041FOMM
Video ReconstructionTai-Chi-HD (512)AED0.172Siarohin et al.
Video ReconstructionTai-Chi-HD (512)AKD13.86Siarohin et al.
Video ReconstructionTai-Chi-HD (512)L10.064Siarohin et al.
Video ReconstructionTai-Chi-HD (512)MKR0.043Siarohin et al.
Video ReconstructionTai-Chi-HD (512)AED0.203FOMM
Video ReconstructionTai-Chi-HD (512)AKD17.12FOMM
Video ReconstructionTai-Chi-HD (512)L10.075FOMM
Video ReconstructionTai-Chi-HD (512)MKR0.066FOMM
Video ReconstructionMGifL10.0206Siarohin et al.
Video ReconstructionMGifL10.0223FOMM
Video ReconstructionTED-talksAED0.114Siarohin et al.
Video ReconstructionTED-talksAKD3.75Siarohin et al.
Video ReconstructionTED-talksL10.026Siarohin et al.
Video ReconstructionTED-talksMKR0.007Siarohin et al.
Video ReconstructionTED-talksAED0.163FOMM
Video ReconstructionTED-talksAKD7.07FOMM
Video ReconstructionTED-talksL10.033FOMM
Video ReconstructionTED-talksMKR0.014FOMM

Related Papers

GSVR: 2D Gaussian-based Video Representation for 800+ FPS with Hybrid Deformation Field2025-07-08Quanta Diffusion2025-06-07Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation2025-06-04Compressing Human Body Video with Interactive Semantics: A Generative Approach2025-05-22Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction2025-05-22V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation2025-05-22Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space2025-05-22Few-shot Semantic Encoding and Decoding for Video Surveillance2025-05-12