TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unsupervised Learning of Object Landmarks through Conditio...

Unsupervised Learning of Object Landmarks through Conditional Image Generation

Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi

2018-06-20NeurIPS 2018 12Unsupervised Facial Landmark DetectionImage GenerationConditional Image Generation
PaperPDFCodeCode

Abstract

We propose a method for learning landmark detectors for visual objects (such as the eyes and the nose in a face) without any manual supervision. We cast this as the problem of generating images that combine the appearance of the object as seen in a first example image with the geometry of the object as seen in a second example image, where the two examples differ by a viewpoint change and/or an object deformation. In order to factorize appearance and geometry, we introduce a tight bottleneck in the geometry-extraction process that selects and distils geometry-related features. Compared to standard image generation problems, which often use generative adversarial networks, our generation task is conditioned on both appearance and geometry and thus is significantly less ambiguous, to the point that adopting a simple perceptual loss formulation is sufficient. We demonstrate that our approach can learn object landmarks from synthetic image deformations or videos, all without manual supervision, while outperforming state-of-the-art unsupervised landmark detectors. We further show that our method is applicable to a large variety of datasets - faces, people, 3D objects, and digits - without any modifications.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingMAFLNME2.54Conditional Image Generation
Facial Recognition and ModellingMAFL UnalignedNME8.74IMM
Facial Recognition and ModellingAFLW (Zhang CVPR 2018 crops)NME6.31Conditional Image Generation
Facial Landmark DetectionMAFLNME2.54Conditional Image Generation
Facial Landmark DetectionMAFL UnalignedNME8.74IMM
Facial Landmark DetectionAFLW (Zhang CVPR 2018 crops)NME6.31Conditional Image Generation
Face ReconstructionMAFLNME2.54Conditional Image Generation
Face ReconstructionMAFL UnalignedNME8.74IMM
Face ReconstructionAFLW (Zhang CVPR 2018 crops)NME6.31Conditional Image Generation
3DMAFLNME2.54Conditional Image Generation
3DMAFL UnalignedNME8.74IMM
3DAFLW (Zhang CVPR 2018 crops)NME6.31Conditional Image Generation
3D Face ModellingMAFLNME2.54Conditional Image Generation
3D Face ModellingMAFL UnalignedNME8.74IMM
3D Face ModellingAFLW (Zhang CVPR 2018 crops)NME6.31Conditional Image Generation
3D Face ReconstructionMAFLNME2.54Conditional Image Generation
3D Face ReconstructionMAFL UnalignedNME8.74IMM
3D Face ReconstructionAFLW (Zhang CVPR 2018 crops)NME6.31Conditional Image Generation

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15