TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A novel Region of Interest Extraction Layer for Instance S...

A novel Region of Interest Extraction Layer for Instance Segmentation

Leonardo Rossi, Akbar Karimi, Andrea Prati

2020-04-28SegmentationSemantic SegmentationInstance Segmentationobject-detectionObject Detection
PaperPDFCodeCodeCodeCodeCode(official)

Abstract

Given the wide diffusion of deep neural network architectures for computer vision tasks, several new applications are nowadays more and more feasible. Among them, a particular attention has been recently given to instance segmentation, by exploiting the results achievable by two-stage networks (such as Mask R-CNN or Faster R-CNN), derived from R-CNN. In these complex architectures, a crucial role is played by the Region of Interest (RoI) extraction layer, devoted to extracting a coherent subset of features from a single Feature Pyramid Network (FPN) layer attached on top of a backbone. This paper is motivated by the need to overcome the limitations of existing RoI extractors which select only one (the best) layer from FPN. Our intuition is that all the layers of FPN retain useful information. Therefore, the proposed layer (called Generic RoI Extractor - GRoIE) introduces non-local building blocks and attention mechanisms to boost the performance. A comprehensive ablation study at component level is conducted to find the best set of algorithms and parameters for the GRoIE layer. Moreover, GRoIE can be integrated seamlessly with every two-stage architecture for both object detection and instance segmentation tasks. Therefore, the improvements brought about by the use of GRoIE in different state-of-the-art architectures are also evaluated. The proposed layer leads up to gain a 1.1% AP improvement on bounding box detection and 1.7% AP improvement on instance segmentation. The code is publicly available on GitHub repository at https://github.com/IMPLabUniPr/mmdetection/tree/groie_dev

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5059.9Mask R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAP7541.7Mask R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAPL49.7Mask R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAPM42.1Mask R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAPS22.9Mask R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalbox AP38.4Mask R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAP5059.2Faster R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAP7540.6Faster R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAPL47.8Faster R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAPM41.5Faster R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalAPS22.3Faster R-CNN (ResNet-50-FPN, GRoIE)
Object DetectionCOCO minivalbox AP37.5Faster R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAP5059.9Mask R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAP7541.7Mask R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAPL49.7Mask R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAPM42.1Mask R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAPS22.9Mask R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalbox AP38.4Mask R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAP5059.2Faster R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAP7540.6Faster R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAPL47.8Faster R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAPM41.5Faster R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalAPS22.3Faster R-CNN (ResNet-50-FPN, GRoIE)
3DCOCO minivalbox AP37.5Faster R-CNN (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAP5059.3GCnet (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAP7539.8GCnet (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAPL51.2GCnet (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAPM41GCnet (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAPS20.2GCnet (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalmask AP37.2GCnet (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAP5057.1Mask R-CNN (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAP7538Mask R-CNN (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAPL48.7Mask R-CNN (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAPM39Mask R-CNN (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalAPS19.1Mask R-CNN (ResNet-50-FPN, GRoIE)
Instance SegmentationCOCO minivalmask AP35.8Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAP5059.9Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAP7541.7Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAPL49.7Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAPM42.1Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAPS22.9Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalbox AP38.4Mask R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAP5059.2Faster R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAP7540.6Faster R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAPL47.8Faster R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAPM41.5Faster R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalAPS22.3Faster R-CNN (ResNet-50-FPN, GRoIE)
2D ClassificationCOCO minivalbox AP37.5Faster R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAP5059.9Mask R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAP7541.7Mask R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAPL49.7Mask R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAPM42.1Mask R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAPS22.9Mask R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalbox AP38.4Mask R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAP5059.2Faster R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAP7540.6Faster R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAPL47.8Faster R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAPM41.5Faster R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalAPS22.3Faster R-CNN (ResNet-50-FPN, GRoIE)
2D Object DetectionCOCO minivalbox AP37.5Faster R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAP5059.9Mask R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAP7541.7Mask R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAPL49.7Mask R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAPM42.1Mask R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAPS22.9Mask R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalbox AP38.4Mask R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAP5059.2Faster R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAP7540.6Faster R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAPL47.8Faster R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAPM41.5Faster R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalAPS22.3Faster R-CNN (ResNet-50-FPN, GRoIE)
16kCOCO minivalbox AP37.5Faster R-CNN (ResNet-50-FPN, GRoIE)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17