TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Large-Scale Long-Tailed Recognition in an Open World

Large-Scale Long-Tailed Recognition in an Open World

Ziwei Liu, Zhongqi Miao, Xiaohang Zhan, Jiayun Wang, Boqing Gong, Stella X. Yu

2019-04-10CVPR 2019 6Few-Shot LearningOpen Set LearningLong-tail LearningLong-tail learning with class descriptorsGeneral ClassificationClassificationimbalanced classification
PaperPDFCodeCode

Abstract

Real world data often have a long-tailed and open-ended distribution. A practical recognition system must classify among majority and minority classes, generalize from a few known instances, and acknowledge novelty upon a never seen instance. We define Open Long-Tailed Recognition (OLTR) as learning from such naturally distributed data and optimizing the classification accuracy over a balanced test set which include head, tail, and open classes. OLTR must handle imbalanced classification, few-shot learning, and open-set recognition in one integrated algorithm, whereas existing classification approaches focus only on one aspect and deliver poorly over the entire class spectrum. The key challenges are how to share visual knowledge between head and tail classes and how to reduce confusion between tail and open classes. We develop an integrated OLTR algorithm that maps an image to a feature space such that visual concepts can easily relate to each other based on a learned metric that respects the closed-world classification while acknowledging the novelty of the open world. Our so-called dynamic meta-embedding combines a direct image feature and an associated memory feature, with the feature norm indicating the familiarity to known classes. On three large-scale OLTR datasets we curate from object-centric ImageNet, scene-centric Places, and face-centric MS1M data, our method consistently outperforms the state-of-the-art. Our code, datasets, and models enable future OLTR research and are publicly available at https://liuziwei7.github.io/projects/LongTail.html.

Results

TaskDatasetMetricValueModel
Image ClassificationPlaces-LTTop-1 Accuracy34.1OLTR
Image ClassificationImageNet-LTTop-1 Accuracy35.6OLTR
Image ClassificationCOCO-MLTAverage mAP45.83OLTR(ResNet-50)
Image ClassificationVOC-MLTAverage mAP71.02OLTR(ResNet-50)
Image ClassificationImageNet-LT-dPer-Class Accuracy37.7OLTR
Few-Shot Image ClassificationPlaces-LTTop-1 Accuracy34.1OLTR
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy35.6OLTR
Few-Shot Image ClassificationCOCO-MLTAverage mAP45.83OLTR(ResNet-50)
Few-Shot Image ClassificationVOC-MLTAverage mAP71.02OLTR(ResNet-50)
Few-Shot Image ClassificationImageNet-LT-dPer-Class Accuracy37.7OLTR
Generalized Few-Shot ClassificationPlaces-LTTop-1 Accuracy34.1OLTR
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy35.6OLTR
Generalized Few-Shot ClassificationCOCO-MLTAverage mAP45.83OLTR(ResNet-50)
Generalized Few-Shot ClassificationVOC-MLTAverage mAP71.02OLTR(ResNet-50)
Generalized Few-Shot ClassificationImageNet-LT-dPer-Class Accuracy37.7OLTR
Long-tail LearningPlaces-LTTop-1 Accuracy34.1OLTR
Long-tail LearningImageNet-LTTop-1 Accuracy35.6OLTR
Long-tail LearningCOCO-MLTAverage mAP45.83OLTR(ResNet-50)
Long-tail LearningVOC-MLTAverage mAP71.02OLTR(ResNet-50)
Long-tail LearningImageNet-LT-dPer-Class Accuracy37.7OLTR
Generalized Few-Shot LearningPlaces-LTTop-1 Accuracy34.1OLTR
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy35.6OLTR
Generalized Few-Shot LearningCOCO-MLTAverage mAP45.83OLTR(ResNet-50)
Generalized Few-Shot LearningVOC-MLTAverage mAP71.02OLTR(ResNet-50)
Generalized Few-Shot LearningImageNet-LT-dPer-Class Accuracy37.7OLTR

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Safeguarding Federated Learning-based Road Condition Classification2025-07-16AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10An Enhanced Privacy-preserving Federated Few-shot Learning Framework for Respiratory Disease Diagnosis2025-07-10Few-Shot Learning by Explicit Physics Integration: An Application to Groundwater Heat Transport2025-07-08