TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Attention-Driven Dynamic Graph Convolutional Network for M...

Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition

Jin Ye, Junjun He, Xiaojiang Peng, Wenhao Wu, Yu Qiao

2020-12-05ECCV 2020 8Multi-Label Classification
PaperPDFCode(official)

Abstract

Recent studies often exploit Graph Convolutional Network (GCN) to model label dependencies to improve recognition accuracy for multi-label image recognition. However, constructing a graph by counting the label co-occurrence possibilities of the training data may degrade model generalizability, especially when there exist occasional co-occurrence objects in test images. Our goal is to eliminate such bias and enhance the robustness of the learnt features. To this end, we propose an Attention-Driven Dynamic Graph Convolutional Network (ADD-GCN) to dynamically generate a specific graph for each image. ADD-GCN adopts a Dynamic Graph Convolutional Network (D-GCN) to model the relation of content-aware category representations that are generated by a Semantic Attention Module (SAM). Extensive experiments on public multi-label benchmarks demonstrate the effectiveness of our method, which achieves mAPs of 85.2%, 96.0%, and 95.5% on MS-COCO, VOC2007, and VOC2012, respectively, and outperforms current state-of-the-art methods with a clear margin. All codes can be found at https://github.com/Yejin0111/ADD-GCN.

Results

TaskDatasetMetricValueModel
Multi-Label ClassificationMS-COCOmAP85.2ADD-GCN

Related Papers

Privacy-Preserving Chest X-ray Classification in Latent Space with Homomorphically Encrypted Neural Inference2025-06-18Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation2025-06-17AgriPotential: A Novel Multi-Spectral and Multi-Temporal Remote Sensing Dataset for Agricultural Potentials2025-06-13MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification2025-06-12ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese2025-06-11Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis2025-06-05PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches2025-05-30Efficient Text Encoders for Labor Market Analysis2025-05-30