TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning to Discover Multi-Class Attentional Regions for M...

Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition

Bin-Bin Gao, Hong-Yu Zhou

2020-07-03Multi-Label Image ClassificationMulti-Label Classification
PaperPDFCode(official)

Abstract

Multi-label image recognition is a practical and challenging task compared to single-label image classification. However, previous works may be suboptimal because of a great number of object proposals or complex attentional region generation modules. In this paper, we propose a simple but efficient two-stream framework to recognize multi-category objects from global image to local regions, similar to how human beings perceive objects. To bridge the gap between global and local streams, we propose a multi-class attentional region module which aims to make the number of attentional regions as small as possible and keep the diversity of these regions as high as possible. Our method can efficiently and effectively recognize multi-class objects with an affordable computation cost and a parameter-free region localization module. Over three benchmarks on multi-label image classification, we create new state-of-the-art results with a single model only using image semantics without label dependency. In addition, the effectiveness of the proposed method is extensively demonstrated under different factors such as global pooling strategy, input size and network architecture. Code has been made available at~\url{https://github.com/gaobb/MCAR}.

Results

TaskDatasetMetricValueModel
Multi-Label ClassificationPASCAL VOC 2012mAP94.3MCAR (ResNet101, 448x448)
Multi-Label ClassificationMS-COCOmAP84.5MCAR (ResNet101, 576x576)
Multi-Label ClassificationMS-COCOmAP83.8MCAR (ResNet101, 448x448)
Multi-Label ClassificationPASCAL VOC 2007mAP94.8MCAR (ResNet101, 448x448)

Related Papers

Privacy-Preserving Chest X-ray Classification in Latent Space with Homomorphically Encrypted Neural Inference2025-06-18Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation2025-06-17AgriPotential: A Novel Multi-Spectral and Multi-Temporal Remote Sensing Dataset for Agricultural Potentials2025-06-13MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification2025-06-12ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese2025-06-11Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis2025-06-05PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches2025-05-30Efficient Text Encoders for Labor Market Analysis2025-05-30