Large-Scale Long-Tailed Recognition in an Open World

Ziwei Liu, Zhongqi Miao, Xiaohang Zhan, Jiayun Wang, Boqing Gong, Stella X. Yu

2019-04-10CVPR 2019 6Few-Shot Learning Open Set Learning Long-tail Learning Long-tail learning with class descriptors General Classification Classification imbalanced classification

Paper PDF Code Code

Abstract

Real world data often have a long-tailed and open-ended distribution. A practical recognition system must classify among majority and minority classes, generalize from a few known instances, and acknowledge novelty upon a never seen instance. We define Open Long-Tailed Recognition (OLTR) as learning from such naturally distributed data and optimizing the classification accuracy over a balanced test set which include head, tail, and open classes. OLTR must handle imbalanced classification, few-shot learning, and open-set recognition in one integrated algorithm, whereas existing classification approaches focus only on one aspect and deliver poorly over the entire class spectrum. The key challenges are how to share visual knowledge between head and tail classes and how to reduce confusion between tail and open classes. We develop an integrated OLTR algorithm that maps an image to a feature space such that visual concepts can easily relate to each other based on a learned metric that respects the closed-world classification while acknowledging the novelty of the open world. Our so-called dynamic meta-embedding combines a direct image feature and an associated memory feature, with the feature norm indicating the familiarity to known classes. On three large-scale OLTR datasets we curate from object-centric ImageNet, scene-centric Places, and face-centric MS1M data, our method consistently outperforms the state-of-the-art. Our code, datasets, and models enable future OLTR research and are publicly available at https://liuziwei7.github.io/projects/LongTail.html.

Results

Task	Dataset	Metric	Value	Model
Image Classification	Places-LT	Top-1 Accuracy	34.1	OLTR
Image Classification	ImageNet-LT	Top-1 Accuracy	35.6	OLTR
Image Classification	COCO-MLT	Average mAP	45.83	OLTR(ResNet-50)
Image Classification	VOC-MLT	Average mAP	71.02	OLTR(ResNet-50)
Image Classification	ImageNet-LT-d	Per-Class Accuracy	37.7	OLTR
Few-Shot Image Classification	Places-LT	Top-1 Accuracy	34.1	OLTR
Few-Shot Image Classification	ImageNet-LT	Top-1 Accuracy	35.6	OLTR
Few-Shot Image Classification	COCO-MLT	Average mAP	45.83	OLTR(ResNet-50)
Few-Shot Image Classification	VOC-MLT	Average mAP	71.02	OLTR(ResNet-50)
Few-Shot Image Classification	ImageNet-LT-d	Per-Class Accuracy	37.7	OLTR
Generalized Few-Shot Classification	Places-LT	Top-1 Accuracy	34.1	OLTR
Generalized Few-Shot Classification	ImageNet-LT	Top-1 Accuracy	35.6	OLTR
Generalized Few-Shot Classification	COCO-MLT	Average mAP	45.83	OLTR(ResNet-50)
Generalized Few-Shot Classification	VOC-MLT	Average mAP	71.02	OLTR(ResNet-50)
Generalized Few-Shot Classification	ImageNet-LT-d	Per-Class Accuracy	37.7	OLTR
Long-tail Learning	Places-LT	Top-1 Accuracy	34.1	OLTR
Long-tail Learning	ImageNet-LT	Top-1 Accuracy	35.6	OLTR
Long-tail Learning	COCO-MLT	Average mAP	45.83	OLTR(ResNet-50)
Long-tail Learning	VOC-MLT	Average mAP	71.02	OLTR(ResNet-50)
Long-tail Learning	ImageNet-LT-d	Per-Class Accuracy	37.7	OLTR
Generalized Few-Shot Learning	Places-LT	Top-1 Accuracy	34.1	OLTR
Generalized Few-Shot Learning	ImageNet-LT	Top-1 Accuracy	35.6	OLTR
Generalized Few-Shot Learning	COCO-MLT	Average mAP	45.83	OLTR(ResNet-50)
Generalized Few-Shot Learning	VOC-MLT	Average mAP	71.02	OLTR(ResNet-50)
Generalized Few-Shot Learning	ImageNet-LT-d	Per-Class Accuracy	37.7	OLTR

Large-Scale Long-Tailed Recognition in an Open World

Abstract

Results

Related Papers

Large-Scale Long-Tailed Recognition in an Open World

Abstract

Results

Related Papers