Multiple Anchor Learning for Visual Object Detection

Wei Ke, Tianliang Zhang, Zeyi Huang, Qixiang Ye, Jianzhuang Liu, Dong Huang

2019-12-04CVPR 2020 6Multiple Instance Learning General Classification object-detection Object Detection

Abstract

Classification and localization are two pillars of visual object detectors. However, in CNN-based detectors, these two modules are usually optimized under a fixed set of candidate (or anchor) bounding boxes. This configuration significantly limits the possibility to jointly optimize classification and localization. In this paper, we propose a Multiple Instance Learning (MIL) approach that selects anchors and jointly optimizes the two modules of a CNN-based object detector. Our approach, referred to as Multiple Anchor Learning (MAL), constructs anchor bags and selects the most representative anchors from each bag. Such an iterative selection process is potentially NP-hard to optimize. To address this issue, we solve MAL by repetitively depressing the confidence of selected anchors by perturbing their corresponding features. In an adversarial selection-depression manner, MAL not only pursues optimal solutions but also fully leverages multiple anchors/features to learn a detection model. Experiments show that MAL improves the baseline RetinaNet with significant margins on the commonly used MS-COCO object detection benchmark and achieves new state-of-the-art detection performance compared with recent methods.

Results

Task	Dataset	Metric	Value	Model
Object Detection	COCO test-dev	box mAP	47	MAL (ResNeXt101, multi-scale)
Object Detection	COCO test-dev	box mAP	45.9	MAL (ResNeXt101, single-scale)
Object Detection	COCO test-dev	box mAP	39.2	MAL (ResNet50, single-scale)
3D	COCO test-dev	box mAP	47	MAL (ResNeXt101, multi-scale)
3D	COCO test-dev	box mAP	45.9	MAL (ResNeXt101, single-scale)
3D	COCO test-dev	box mAP	39.2	MAL (ResNet50, single-scale)
2D Classification	COCO test-dev	box mAP	47	MAL (ResNeXt101, multi-scale)
2D Classification	COCO test-dev	box mAP	45.9	MAL (ResNeXt101, single-scale)
2D Classification	COCO test-dev	box mAP	39.2	MAL (ResNet50, single-scale)
2D Object Detection	COCO test-dev	box mAP	47	MAL (ResNeXt101, multi-scale)
2D Object Detection	COCO test-dev	box mAP	45.9	MAL (ResNeXt101, single-scale)
2D Object Detection	COCO test-dev	box mAP	39.2	MAL (ResNet50, single-scale)
16k	COCO test-dev	box mAP	47	MAL (ResNeXt101, multi-scale)
16k	COCO test-dev	box mAP	45.9	MAL (ResNeXt101, single-scale)
16k	COCO test-dev	box mAP	39.2	MAL (ResNet50, single-scale)

Multiple Anchor Learning for Visual Object Detection

Abstract

Results

Related Papers

Multiple Anchor Learning for Visual Object Detection

Abstract

Results

Related Papers