TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unsupervised Part Segmentation through Disentangling Appea...

Unsupervised Part Segmentation through Disentangling Appearance and Shape

Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu

2021-05-26CVPR 2021 1DisentanglementUnsupervised Facial Landmark DetectionSegmentationSemantic Segmentation
PaperPDF

Abstract

We study the problem of unsupervised discovery and segmentation of object parts, which, as an intermediate local representation, are capable of finding intrinsic object structure and providing more explainable recognition results. Recent unsupervised methods have greatly relaxed the dependency on annotated data which are costly to obtain, but still rely on additional information such as object segmentation mask or saliency map. To remove such a dependency and further improve the part segmentation performance, we develop a novel approach by disentangling the appearance and shape representations of object parts followed with reconstruction losses without using additional object mask information. To avoid degenerated solutions, a bottleneck block is designed to squeeze and expand the appearance representation, leading to a more effective disentanglement between geometry and appearance. Combined with a self-supervised part classification loss and an improved geometry concentration constraint, we can segment more consistent parts with semantic meanings. Comprehensive experiments on a wide variety of objects such as face, bird, and PASCAL VOC objects demonstrate the effectiveness of the proposed method.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingMAFL UnalignedNME12.26UPSDAS
Facial Recognition and ModellingAFLW UnalignedNME13.13UPSDAP
Facial Recognition and ModellingAFLW UnalignedNME13.31IMM
Facial Recognition and ModellingAFLW UnalignedNME13.6Lorenz2019unsupervised
Facial Recognition and ModellingAFLW UnalignedNME16.05SCOPS
Facial Landmark DetectionMAFL UnalignedNME12.26UPSDAS
Facial Landmark DetectionAFLW UnalignedNME13.13UPSDAP
Facial Landmark DetectionAFLW UnalignedNME13.31IMM
Facial Landmark DetectionAFLW UnalignedNME13.6Lorenz2019unsupervised
Facial Landmark DetectionAFLW UnalignedNME16.05SCOPS
Face ReconstructionMAFL UnalignedNME12.26UPSDAS
Face ReconstructionAFLW UnalignedNME13.13UPSDAP
Face ReconstructionAFLW UnalignedNME13.31IMM
Face ReconstructionAFLW UnalignedNME13.6Lorenz2019unsupervised
Face ReconstructionAFLW UnalignedNME16.05SCOPS
3DMAFL UnalignedNME12.26UPSDAS
3DAFLW UnalignedNME13.13UPSDAP
3DAFLW UnalignedNME13.31IMM
3DAFLW UnalignedNME13.6Lorenz2019unsupervised
3DAFLW UnalignedNME16.05SCOPS
3D Face ModellingMAFL UnalignedNME12.26UPSDAS
3D Face ModellingAFLW UnalignedNME13.13UPSDAP
3D Face ModellingAFLW UnalignedNME13.31IMM
3D Face ModellingAFLW UnalignedNME13.6Lorenz2019unsupervised
3D Face ModellingAFLW UnalignedNME16.05SCOPS
3D Face ReconstructionMAFL UnalignedNME12.26UPSDAS
3D Face ReconstructionAFLW UnalignedNME13.13UPSDAP
3D Face ReconstructionAFLW UnalignedNME13.31IMM
3D Face ReconstructionAFLW UnalignedNME13.6Lorenz2019unsupervised
3D Face ReconstructionAFLW UnalignedNME16.05SCOPS

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models2025-07-18Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17