TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Label Distribution Learning with Label Ambiguity

Deep Label Distribution Learning with Label Ambiguity

Bin-Bin Gao, Chao Xing, Chen-Wei Xie, Jianxin Wu, Xin Geng

2016-11-06Age EstimationSemantic SegmentationPose EstimationGeneral ClassificationClassificationMulti-Label ClassificationHead Pose Estimation
PaperPDFCodeCode(official)

Abstract

Convolutional Neural Networks (ConvNets) have achieved excellent recognition performance in various visual recognition tasks. A large labeled training set is one of the most important factors for its success. However, it is difficult to collect sufficient training images with precise labels in some domains such as apparent age estimation, head pose estimation, multi-label classification and semantic segmentation. Fortunately, there is ambiguous information among labels, which makes these tasks different from traditional classification. Based on this observation, we convert the label of each image into a discrete label distribution, and learn the label distribution by minimizing a Kullback-Leibler divergence between the predicted and ground-truth label distributions using deep ConvNets. The proposed DLDL (Deep Label Distribution Learning) method effectively utilizes the label ambiguity in both feature learning and classifier learning, which help prevent the network from over-fitting even when the training set is small. Experimental results show that the proposed approach produces significantly better results than state-of-the-art methods for age estimation and head pose estimation. At the same time, it also improves recognition performance for multi-label classification and semantic segmentation tasks.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingChaLearn 2015MAE3.51DLDL+VGG-Face
Facial Recognition and ModellingChaLearn 2015e-error0.31DLDL+VGG-Face
Facial Recognition and ModellingMORPH Album2MAE2.42DLDL+VGG-Face (KL, Max)3
Semantic SegmentationPASCAL VOC 2012Mean IoU67.1DLDL-8s+CRF
Semantic SegmentationPASCAL VOC 2011Mean IoU67.6DLDL-8s+CRF
Pose EstimationPointing'04MAE4.64Ours DLDL (KL)
Pose EstimationAFLWMAE9.78DLDL (KL)
Pose EstimationBJUT-3DMAE0.09Ours DLDL (KL)
Multi-Label ClassificationPASCAL VOC 2012mAP92.4Ours PF-DLDL
Multi-Label ClassificationPASCAL VOC 2007mAP93.4Ours PF-DLDL
Face ReconstructionChaLearn 2015MAE3.51DLDL+VGG-Face
Face ReconstructionChaLearn 2015e-error0.31DLDL+VGG-Face
Face ReconstructionMORPH Album2MAE2.42DLDL+VGG-Face (KL, Max)3
3DPointing'04MAE4.64Ours DLDL (KL)
3DAFLWMAE9.78DLDL (KL)
3DBJUT-3DMAE0.09Ours DLDL (KL)
3DChaLearn 2015MAE3.51DLDL+VGG-Face
3DChaLearn 2015e-error0.31DLDL+VGG-Face
3DMORPH Album2MAE2.42DLDL+VGG-Face (KL, Max)3
3D Face ModellingChaLearn 2015MAE3.51DLDL+VGG-Face
3D Face ModellingChaLearn 2015e-error0.31DLDL+VGG-Face
3D Face ModellingMORPH Album2MAE2.42DLDL+VGG-Face (KL, Max)3
3D Face ReconstructionChaLearn 2015MAE3.51DLDL+VGG-Face
3D Face ReconstructionChaLearn 2015e-error0.31DLDL+VGG-Face
3D Face ReconstructionMORPH Album2MAE2.42DLDL+VGG-Face (KL, Max)3
10-shot image generationPASCAL VOC 2012Mean IoU67.1DLDL-8s+CRF
10-shot image generationPASCAL VOC 2011Mean IoU67.6DLDL-8s+CRF
Age EstimationChaLearn 2015MAE3.51DLDL+VGG-Face
Age EstimationChaLearn 2015e-error0.31DLDL+VGG-Face
Age EstimationMORPH Album2MAE2.42DLDL+VGG-Face (KL, Max)3
1 Image, 2*2 StitchiPointing'04MAE4.64Ours DLDL (KL)
1 Image, 2*2 StitchiAFLWMAE9.78DLDL (KL)
1 Image, 2*2 StitchiBJUT-3DMAE0.09Ours DLDL (KL)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17