TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LatentKeypointGAN: Controlling Images via Latent Keypoints

LatentKeypointGAN: Controlling Images via Latent Keypoints

Xingzhe He, Bastian Wandt, Helge Rhodin

2021-03-29Unsupervised Keypoint EstimationUnsupervised Human Pose EstimationUnsupervised Facial Landmark DetectionKeypoint DetectionImage Quality AssessmentImage Generation
PaperPDFCode

Abstract

Generative adversarial networks (GANs) have attained photo-realistic quality in image generation. However, how to best control the image content remains an open challenge. We introduce LatentKeypointGAN, a two-stage GAN which is trained end-to-end on the classical GAN objective with internal conditioning on a set of space keypoints. These keypoints have associated appearance embeddings that respectively control the position and style of the generated objects and their parts. A major difficulty that we address with suitable network architectures and training schemes is disentangling the image into spatial and appearance factors without domain knowledge and supervision signals. We demonstrate that LatentKeypointGAN provides an interpretable latent space that can be used to re-arrange the generated images by re-positioning and exchanging keypoint embeddings, such as generating portraits by combining the eyes, nose, and mouth from different images. In addition, the explicit generation of keypoints and matching images enables a new, GAN-based method for unsupervised keypoint detection.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingMAFLNME5.85LatentKeypointGAN
Facial Landmark DetectionMAFLNME5.85LatentKeypointGAN
Face ReconstructionMAFLNME5.85LatentKeypointGAN
3DMAFLNME5.85LatentKeypointGAN
3D Face ModellingMAFLNME5.85LatentKeypointGAN
3D Face ReconstructionMAFLNME5.85LatentKeypointGAN

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16