TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BlendMask: Top-Down Meets Bottom-Up for Instance Segmentat...

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

Hao Chen, Kunyang Sun, Zhi Tian, Chunhua Shen, Yongming Huang, Youliang Yan

2020-01-02CVPR 2020 6Real-time Instance SegmentationSegmentationSemantic SegmentationInstance Segmentation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Instance segmentation is one of the fundamental vision tasks. Recently, fully convolutional instance segmentation methods have drawn much attention as they are often simpler and more efficient than two-stage approaches like Mask R-CNN. To date, almost all such approaches fall behind the two-stage Mask R-CNN method in mask precision when models have similar computation complexity, leaving great room for improvement. In this work, we achieve improved mask prediction by effectively combining instance-level information with semantic information with lower-level fine-granularity. Our main contribution is a blender module which draws inspiration from both top-down and bottom-up instance segmentation approaches. The proposed BlendMask can effectively predict dense per-pixel position-sensitive instance features with very few channels, and learn attention maps for each instance with merely one convolution layer, thus being fast in inference. BlendMask can be easily incorporated with the state-of-the-art one-stage detection frameworks and outperforms Mask R-CNN under the same training schedule while being 20% faster. A light-weight version of BlendMask achieves $ 34.2% $ mAP at 25 FPS evaluated on a single 1080Ti GPU card. Because of its simplicity and efficacy, we hope that our BlendMask could serve as a simple yet strong baseline for a wide range of instance-wise prediction tasks. Code is available at https://git.io/AdelaiDet

Results

TaskDatasetMetricValueModel
Instance SegmentationCOCO test-devAP5063.1BlendMask (ResNet-101 + DCN interval=3)
Instance SegmentationCOCO test-devAP7544.6BlendMask (ResNet-101 + DCN interval=3)
Instance SegmentationCOCO test-devAPL54.5BlendMask (ResNet-101 + DCN interval=3)
Instance SegmentationCOCO test-devAPM44.1BlendMask (ResNet-101 + DCN interval=3)
Instance SegmentationCOCO test-devAPS22.7BlendMask (ResNet-101 + DCN interval=3)
Instance SegmentationCOCO test-devmask AP41.3BlendMask (ResNet-101 + DCN interval=3)
Instance SegmentationMSCOCOFrame (fps)33.3BlendMask-512 (DLA_34)
Instance SegmentationMSCOCOmask AP35.2BlendMask-512 (DLA_34)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17