TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Generalized Parametric Contrastive Learning

Generalized Parametric Contrastive Learning

Jiequan Cui, Zhisheng Zhong, Zhuotao Tian, Shu Liu, Bei Yu, Jiaya Jia

2022-09-26Image ClassificationLong-tail LearningDomain GeneralizationSemantic SegmentationContrastive Learning
PaperPDFCodeCodeCode(official)Code

Abstract

In this paper, we propose the Generalized Parametric Contrastive Learning (GPaCo/PaCo) which works well on both imbalanced and balanced data. Based on theoretical analysis, we observe that supervised contrastive loss tends to bias high-frequency classes and thus increases the difficulty of imbalanced learning. We introduce a set of parametric class-wise learnable centers to rebalance from an optimization perspective. Further, we analyze our GPaCo/PaCo loss under a balanced setting. Our analysis demonstrates that GPaCo/PaCo can adaptively enhance the intensity of pushing samples of the same class close as more samples are pulled together with their corresponding centers and benefit hard example learning. Experiments on long-tailed benchmarks manifest the new state-of-the-art for long-tailed recognition. On full ImageNet, models from CNNs to vision transformers trained with GPaCo loss show better generalization performance and stronger robustness compared with MAE models. Moreover, GPaCo can be applied to the semantic segmentation task and obvious improvements are observed on the 4 most popular benchmarks. Our code is available at https://github.com/dvlab-research/Parametric-Contrastive-Learning.

Results

TaskDatasetMetricValueModel
Domain AdaptationImageNet-RTop-1 Error Rate39.7GPaCo (ViT-L)
Domain AdaptationImageNet-Cmean Corruption Error (mCE)39GPaCo (ViT-L)
Domain AdaptationImageNet-SketchTop-1 accuracy48.3GPaCo (ViT-L)
Semantic SegmentationPASCAL ContextmIoU56.2GPaCo (ResNet101)
Semantic SegmentationADE20KValidation mIoU54.3GPaCo (Swin-L)
Image ClassificationPlaces-LTTop-1 Accuracy41.7GPaCo (ResNet-152)
Image ClassificationImageNet-LTTop-1 Accuracy63.2GPaCo (2-ResNeXt101-32x4d)
Few-Shot Image ClassificationPlaces-LTTop-1 Accuracy41.7GPaCo (ResNet-152)
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy63.2GPaCo (2-ResNeXt101-32x4d)
Generalized Few-Shot ClassificationPlaces-LTTop-1 Accuracy41.7GPaCo (ResNet-152)
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy63.2GPaCo (2-ResNeXt101-32x4d)
Long-tail LearningPlaces-LTTop-1 Accuracy41.7GPaCo (ResNet-152)
Long-tail LearningImageNet-LTTop-1 Accuracy63.2GPaCo (2-ResNeXt101-32x4d)
Generalized Few-Shot LearningPlaces-LTTop-1 Accuracy41.7GPaCo (ResNet-152)
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy63.2GPaCo (2-ResNeXt101-32x4d)
Domain GeneralizationImageNet-RTop-1 Error Rate39.7GPaCo (ViT-L)
Domain GeneralizationImageNet-Cmean Corruption Error (mCE)39GPaCo (ViT-L)
Domain GeneralizationImageNet-SketchTop-1 accuracy48.3GPaCo (ViT-L)
10-shot image generationPASCAL ContextmIoU56.2GPaCo (ResNet101)
10-shot image generationADE20KValidation mIoU54.3GPaCo (Swin-L)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17