TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LIP: Local Importance-based Pooling

LIP: Local Importance-based Pooling

Ziteng Gao, Li-Min Wang, Gangshan Wu

2019-08-12ICCV 2019 10Image ClassificationObject Detection
PaperPDFCode(official)

Abstract

Spatial downsampling layers are favored in convolutional neural networks (CNNs) to downscale feature maps for larger receptive fields and less memory consumption. However, for discriminative tasks, there is a possibility that these layers lose the discriminative details due to improper pooling strategies, which could hinder the learning process and eventually result in suboptimal models. In this paper, we present a unified framework over the existing downsampling layers (e.g., average pooling, max pooling, and strided convolution) from a local importance view. In this framework, we analyze the issues of these widely-used pooling layers and figure out the criteria for designing an effective downsampling layer. According to this analysis, we propose a conceptually simple, general, and effective pooling layer based on local importance modeling, termed as {\em Local Importance-based Pooling} (LIP). LIP can automatically enhance discriminative features during the downsampling procedure by learning adaptive importance weights based on inputs. Experiment results show that LIP consistently yields notable gains with different depths and different architectures on ImageNet classification. In the challenging MS COCO dataset, detectors with our LIP-ResNets as backbones obtain a consistent improvement ($\ge 1.4\%$) over the vanilla ResNets, and especially achieve the current state-of-the-art performance in detecting small objects under the single-scale testing scheme.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5065.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
Object DetectionCOCO test-devAP7548.1Faster R-CNN (LIP-ResNet-101-MD w FPN)
Object DetectionCOCO test-devAPL56.3Faster R-CNN (LIP-ResNet-101-MD w FPN)
Object DetectionCOCO test-devAPM46.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
Object DetectionCOCO test-devAPS25.4Faster R-CNN (LIP-ResNet-101-MD w FPN)
Object DetectionCOCO test-devbox mAP43.9Faster R-CNN (LIP-ResNet-101-MD w FPN)
Object DetectionCOCO minivalAP5063.6Faster R-CNN (LIP-ResNet-101)
Object DetectionCOCO minivalAP7545.6Faster R-CNN (LIP-ResNet-101)
Object DetectionCOCO minivalAPM45.8Faster R-CNN (LIP-ResNet-101)
Object DetectionCOCO minivalAPS25.2Faster R-CNN (LIP-ResNet-101)
Object DetectionCOCO minivalbox AP41.7Faster R-CNN (LIP-ResNet-101)
3DCOCO test-devAP5065.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
3DCOCO test-devAP7548.1Faster R-CNN (LIP-ResNet-101-MD w FPN)
3DCOCO test-devAPL56.3Faster R-CNN (LIP-ResNet-101-MD w FPN)
3DCOCO test-devAPM46.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
3DCOCO test-devAPS25.4Faster R-CNN (LIP-ResNet-101-MD w FPN)
3DCOCO test-devbox mAP43.9Faster R-CNN (LIP-ResNet-101-MD w FPN)
3DCOCO minivalAP5063.6Faster R-CNN (LIP-ResNet-101)
3DCOCO minivalAP7545.6Faster R-CNN (LIP-ResNet-101)
3DCOCO minivalAPM45.8Faster R-CNN (LIP-ResNet-101)
3DCOCO minivalAPS25.2Faster R-CNN (LIP-ResNet-101)
3DCOCO minivalbox AP41.7Faster R-CNN (LIP-ResNet-101)
2D ClassificationCOCO test-devAP5065.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D ClassificationCOCO test-devAP7548.1Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D ClassificationCOCO test-devAPL56.3Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D ClassificationCOCO test-devAPM46.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D ClassificationCOCO test-devAPS25.4Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D ClassificationCOCO test-devbox mAP43.9Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D ClassificationCOCO minivalAP5063.6Faster R-CNN (LIP-ResNet-101)
2D ClassificationCOCO minivalAP7545.6Faster R-CNN (LIP-ResNet-101)
2D ClassificationCOCO minivalAPM45.8Faster R-CNN (LIP-ResNet-101)
2D ClassificationCOCO minivalAPS25.2Faster R-CNN (LIP-ResNet-101)
2D ClassificationCOCO minivalbox AP41.7Faster R-CNN (LIP-ResNet-101)
2D Object DetectionCOCO test-devAP5065.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D Object DetectionCOCO test-devAP7548.1Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D Object DetectionCOCO test-devAPL56.3Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D Object DetectionCOCO test-devAPM46.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D Object DetectionCOCO test-devAPS25.4Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D Object DetectionCOCO test-devbox mAP43.9Faster R-CNN (LIP-ResNet-101-MD w FPN)
2D Object DetectionCOCO minivalAP5063.6Faster R-CNN (LIP-ResNet-101)
2D Object DetectionCOCO minivalAP7545.6Faster R-CNN (LIP-ResNet-101)
2D Object DetectionCOCO minivalAPM45.8Faster R-CNN (LIP-ResNet-101)
2D Object DetectionCOCO minivalAPS25.2Faster R-CNN (LIP-ResNet-101)
2D Object DetectionCOCO minivalbox AP41.7Faster R-CNN (LIP-ResNet-101)
16kCOCO test-devAP5065.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
16kCOCO test-devAP7548.1Faster R-CNN (LIP-ResNet-101-MD w FPN)
16kCOCO test-devAPL56.3Faster R-CNN (LIP-ResNet-101-MD w FPN)
16kCOCO test-devAPM46.7Faster R-CNN (LIP-ResNet-101-MD w FPN)
16kCOCO test-devAPS25.4Faster R-CNN (LIP-ResNet-101-MD w FPN)
16kCOCO test-devbox mAP43.9Faster R-CNN (LIP-ResNet-101-MD w FPN)
16kCOCO minivalAP5063.6Faster R-CNN (LIP-ResNet-101)
16kCOCO minivalAP7545.6Faster R-CNN (LIP-ResNet-101)
16kCOCO minivalAPM45.8Faster R-CNN (LIP-ResNet-101)
16kCOCO minivalAPS25.2Faster R-CNN (LIP-ResNet-101)
16kCOCO minivalbox AP41.7Faster R-CNN (LIP-ResNet-101)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17