TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unsupervised Part-Based Disentangling of Object Shape and ...

Unsupervised Part-Based Disentangling of Object Shape and Appearance

Dominik Lorenz, Leonard Bereska, Timo Milbich, Björn Ommer

2019-03-16CVPR 2019 6Unsupervised Human Pose EstimationUnsupervised Facial Landmark DetectionTranslationPose PredictionImage Generation
PaperPDFCodeCode

Abstract

Large intra-class variation is the result of changes in multiple object characteristics. Images, however, only show the superposition of different variable factors such as appearance or shape. Therefore, learning to disentangle and represent these different characteristics poses a great challenge, especially in the unsupervised case. Moreover, large object articulation calls for a flexible part-based model. We present an unsupervised approach for disentangling appearance and shape by learning parts consistently over all instances of a category. Our model for learning an object representation is trained by simultaneously exploiting invariance and equivariance constraints between synthetically transformed images. Since no part annotation or prior information on an object class is required, the approach is applicable to arbitrary classes. We evaluate our approach on a wide range of object categories and diverse tasks including pose prediction, disentangled image synthesis, and video-to-video translation. The approach outperforms the state-of-the-art on unsupervised keypoint prediction and compares favorably even against supervised approaches on the task of shape and appearance transfer.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingMAFLNME3.24Lorenz2019unsupervised
Facial Recognition and ModellingMAFL UnalignedNME11.41Lorenz2019unsupervised
Facial Landmark DetectionMAFLNME3.24Lorenz2019unsupervised
Facial Landmark DetectionMAFL UnalignedNME11.41Lorenz2019unsupervised
Face ReconstructionMAFLNME3.24Lorenz2019unsupervised
Face ReconstructionMAFL UnalignedNME11.41Lorenz2019unsupervised
3DMAFLNME3.24Lorenz2019unsupervised
3DMAFL UnalignedNME11.41Lorenz2019unsupervised
3D Face ModellingMAFLNME3.24Lorenz2019unsupervised
3D Face ModellingMAFL UnalignedNME11.41Lorenz2019unsupervised
3D Face ReconstructionMAFLNME3.24Lorenz2019unsupervised
3D Face ReconstructionMAFL UnalignedNME11.41Lorenz2019unsupervised

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16Function-to-Style Guidance of LLMs for Code Translation2025-07-15