TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FineGAN: Unsupervised Hierarchical Disentanglement for Fin...

FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained Object Generation and Discovery

Krishna Kumar Singh, Utkarsh Ojha, Yong Jae Lee

2018-11-27CVPR 2019 6Fine-Grained Visual CategorizationDisentanglementImage ClusteringConditional Image Generation
PaperPDFCode(official)

Abstract

We propose FineGAN, a novel unsupervised GAN framework, which disentangles the background, object shape, and object appearance to hierarchically generate images of fine-grained object categories. To disentangle the factors without supervision, our key idea is to use information theory to associate each factor to a latent code, and to condition the relationships between the codes in a specific way to induce the desired hierarchy. Through extensive experiments, we show that FineGAN achieves the desired disentanglement to generate realistic and diverse images belonging to fine-grained classes of birds, dogs, and cars. Using FineGAN's automatically learned features, we also cluster real images as a first attempt at solving the novel problem of unsupervised fine-grained object category discovery. Our code/models/demo can be found at https://github.com/kkanshul/finegan

Results

TaskDatasetMetricValueModel
Image GenerationCUB 128 x 128FID11.25FineGAN
Image GenerationCUB 128 x 128Inception score52.53FineGAN
Image GenerationStanford CarsFID16.03FineGAN
Image GenerationStanford CarsInception score32.62FineGAN
Image GenerationStanford DogsFID25.66FineGAN
Image GenerationStanford DogsInception score46.92FineGAN
Image ClusteringStanford CarsAccuracy0.078FineGAN
Image ClusteringStanford CarsNMI0.354FineGAN
Image ClusteringStanford DogsAccuracy0.079FineGAN
Image ClusteringStanford DogsNMI0.233FineGAN
Image ClusteringCUB BirdsAccuracy0.126FineGAN
Image ClusteringCUB BirdsNMI0.403FineGAN

Related Papers

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models2025-07-18Towards Imperceptible JPEG Image Hiding: Multi-range Representations-driven Adversarial Stego Generation2025-07-11Generative Head-Mounted Camera Captures for Photorealistic Avatars2025-07-08Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering2025-07-08Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations2025-07-04Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation2025-07-04Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization2025-07-03SemFaceEdit: Semantic Face Editing on Generative Radiance Manifolds2025-06-28