TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Tri-Layer Plugin to Improve Occluded Detection

A Tri-Layer Plugin to Improve Occluded Detection

Guanqi Zhan, Weidi Xie, Andrew Zisserman

2022-10-18Semantic SegmentationInstance Segmentationobject-detectionObject Detection
PaperPDFCode(official)

Abstract

Detecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We propose a simple 'plugin' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects. The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object. (2) We propose a scalable pipeline for generating training data for the module by using amodal completion of existing object detection and instance segmentation training datasets to establish occlusion relationships. (3) We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects. (4) We show that the plugin module inserted into a two-stage detector can boost the performance significantly, by only fine-tuning the detection head, and with additional improvements if the entire architecture is fine-tuned. COCO results are reported for Mask R-CNN with Swin-T or Swin-S backbones, and Cascade Mask R-CNN with a Swin-B backbone.

Results

TaskDatasetMetricValueModel
Instance SegmentationOccluded COCOMean Recall63.64Swin-B + Cascade Mask R-CNN (tri-layer modelling)
Instance SegmentationOccluded COCOMean Recall62.58Swin-S + Mask R-CNN (tri-layer plugin)
Instance SegmentationOccluded COCOMean Recall62Swin-T + Mask R-CNN (tri-layer plugin)
Instance SegmentationSeparated COCOMean Recall36.88Swin-B + Cascade Mask R-CNN (tri-layer modelling)
Instance SegmentationSeparated COCOMean Recall35.8Swin-S + Mask R-CNN (tri-layer plugin)
Instance SegmentationSeparated COCOMean Recall34.72Swin-T + Mask R-CNN (tri-layer plugin)
Instance SegmentationCOCO test-devmask AP45.9Swin-B + Cascade Mask R-CNN (tri-layer modelling)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17