TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multiple Anchor Learning for Visual Object Detection

Multiple Anchor Learning for Visual Object Detection

Wei Ke, Tianliang Zhang, Zeyi Huang, Qixiang Ye, Jianzhuang Liu, Dong Huang

2019-12-04CVPR 2020 6Multiple Instance LearningGeneral Classificationobject-detectionObject Detection
PaperPDFCodeCodeCode

Abstract

Classification and localization are two pillars of visual object detectors. However, in CNN-based detectors, these two modules are usually optimized under a fixed set of candidate (or anchor) bounding boxes. This configuration significantly limits the possibility to jointly optimize classification and localization. In this paper, we propose a Multiple Instance Learning (MIL) approach that selects anchors and jointly optimizes the two modules of a CNN-based object detector. Our approach, referred to as Multiple Anchor Learning (MAL), constructs anchor bags and selects the most representative anchors from each bag. Such an iterative selection process is potentially NP-hard to optimize. To address this issue, we solve MAL by repetitively depressing the confidence of selected anchors by perturbing their corresponding features. In an adversarial selection-depression manner, MAL not only pursues optimal solutions but also fully leverages multiple anchors/features to learn a detection model. Experiments show that MAL improves the baseline RetinaNet with significant margins on the commonly used MS-COCO object detection benchmark and achieves new state-of-the-art detection performance compared with recent methods.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devbox mAP47MAL (ResNeXt101, multi-scale)
Object DetectionCOCO test-devbox mAP45.9MAL (ResNeXt101, single-scale)
Object DetectionCOCO test-devbox mAP39.2MAL (ResNet50, single-scale)
3DCOCO test-devbox mAP47MAL (ResNeXt101, multi-scale)
3DCOCO test-devbox mAP45.9MAL (ResNeXt101, single-scale)
3DCOCO test-devbox mAP39.2MAL (ResNet50, single-scale)
2D ClassificationCOCO test-devbox mAP47MAL (ResNeXt101, multi-scale)
2D ClassificationCOCO test-devbox mAP45.9MAL (ResNeXt101, single-scale)
2D ClassificationCOCO test-devbox mAP39.2MAL (ResNet50, single-scale)
2D Object DetectionCOCO test-devbox mAP47MAL (ResNeXt101, multi-scale)
2D Object DetectionCOCO test-devbox mAP45.9MAL (ResNeXt101, single-scale)
2D Object DetectionCOCO test-devbox mAP39.2MAL (ResNet50, single-scale)
16kCOCO test-devbox mAP47MAL (ResNeXt101, multi-scale)
16kCOCO test-devbox mAP45.9MAL (ResNeXt101, single-scale)
16kCOCO test-devbox mAP39.2MAL (ResNet50, single-scale)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08