TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hybrid Task Cascade for Instance Segmentation

Hybrid Task Cascade for Instance Segmentation

Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

2019-01-22CVPR 2019 6SegmentationOpen-Ended Question AnsweringSemantic SegmentationInstance Segmentationobject-detectionObject Detection
PaperPDFCodeCode(official)CodeCodeCode

Abstract

Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question. A simple combination of Cascade R-CNN and Mask R-CNN only brings limited gain. In exploring a more effective approach, we find that the key to a successful instance segmentation cascade is to fully leverage the reciprocal relationship between detection and segmentation. In this work, we propose a new framework, Hybrid Task Cascade (HTC), which differs in two important aspects: (1) instead of performing cascaded refinement on these two tasks separately, it interweaves them for a joint multi-stage processing; (2) it adopts a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background. Overall, this framework can learn more discriminative features progressively while integrating complementary features together in each stage. Without bells and whistles, a single HTC obtains 38.4 and 1.5 improvement over a strong Cascade Mask R-CNN baseline on MSCOCO dataset. Moreover, our overall system achieves 48.6 mask AP on the test-challenge split, ranking 1st in the COCO 2018 Challenge Object Detection Task. Code is available at: https://github.com/open-mmlab/mmdetection.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5063.9HTC (ResNeXt-101-FPN)
Object DetectionCOCO test-devAP7544.7HTC (ResNeXt-101-FPN)
Object DetectionCOCO test-devAPL54.6HTC (ResNeXt-101-FPN)
Object DetectionCOCO test-devAPM43.9HTC (ResNeXt-101-FPN)
Object DetectionCOCO test-devAPS22.8HTC (ResNeXt-101-FPN)
Object DetectionCOCO test-devbox mAP47.1HTC (ResNeXt-101-FPN)
Object DetectionCOCO-OAverage mAP19.1HTC (ResNet-50)
Object DetectionCOCO-OEffective Robustness0.08HTC (ResNet-50)
Object DetectionCOCO minivalAP5059.4HTC (cascade)
Object DetectionCOCO minivalAP7540.7HTC (cascade)
Object DetectionCOCO minivalAPL52.3HTC (cascade)
Object DetectionCOCO minivalAPM40.9HTC (cascade)
Object DetectionCOCO minivalAPS20.3HTC (cascade)
Object DetectionCOCO minivalbox AP43.2HTC (cascade)
3DCOCO test-devAP5063.9HTC (ResNeXt-101-FPN)
3DCOCO test-devAP7544.7HTC (ResNeXt-101-FPN)
3DCOCO test-devAPL54.6HTC (ResNeXt-101-FPN)
3DCOCO test-devAPM43.9HTC (ResNeXt-101-FPN)
3DCOCO test-devAPS22.8HTC (ResNeXt-101-FPN)
3DCOCO test-devbox mAP47.1HTC (ResNeXt-101-FPN)
3DCOCO-OAverage mAP19.1HTC (ResNet-50)
3DCOCO-OEffective Robustness0.08HTC (ResNet-50)
3DCOCO minivalAP5059.4HTC (cascade)
3DCOCO minivalAP7540.7HTC (cascade)
3DCOCO minivalAPL52.3HTC (cascade)
3DCOCO minivalAPM40.9HTC (cascade)
3DCOCO minivalAPS20.3HTC (cascade)
3DCOCO minivalbox AP43.2HTC (cascade)
Instance SegmentationCOCO minivalmask AP38.2HTC (ResNet-50)
Instance SegmentationCOCO test-devmask AP41.2HTC + ResNeXt-101-FPN + DCN
2D ClassificationCOCO test-devAP5063.9HTC (ResNeXt-101-FPN)
2D ClassificationCOCO test-devAP7544.7HTC (ResNeXt-101-FPN)
2D ClassificationCOCO test-devAPL54.6HTC (ResNeXt-101-FPN)
2D ClassificationCOCO test-devAPM43.9HTC (ResNeXt-101-FPN)
2D ClassificationCOCO test-devAPS22.8HTC (ResNeXt-101-FPN)
2D ClassificationCOCO test-devbox mAP47.1HTC (ResNeXt-101-FPN)
2D ClassificationCOCO-OAverage mAP19.1HTC (ResNet-50)
2D ClassificationCOCO-OEffective Robustness0.08HTC (ResNet-50)
2D ClassificationCOCO minivalAP5059.4HTC (cascade)
2D ClassificationCOCO minivalAP7540.7HTC (cascade)
2D ClassificationCOCO minivalAPL52.3HTC (cascade)
2D ClassificationCOCO minivalAPM40.9HTC (cascade)
2D ClassificationCOCO minivalAPS20.3HTC (cascade)
2D ClassificationCOCO minivalbox AP43.2HTC (cascade)
2D Object DetectionCOCO test-devAP5063.9HTC (ResNeXt-101-FPN)
2D Object DetectionCOCO test-devAP7544.7HTC (ResNeXt-101-FPN)
2D Object DetectionCOCO test-devAPL54.6HTC (ResNeXt-101-FPN)
2D Object DetectionCOCO test-devAPM43.9HTC (ResNeXt-101-FPN)
2D Object DetectionCOCO test-devAPS22.8HTC (ResNeXt-101-FPN)
2D Object DetectionCOCO test-devbox mAP47.1HTC (ResNeXt-101-FPN)
2D Object DetectionCOCO-OAverage mAP19.1HTC (ResNet-50)
2D Object DetectionCOCO-OEffective Robustness0.08HTC (ResNet-50)
2D Object DetectionCOCO minivalAP5059.4HTC (cascade)
2D Object DetectionCOCO minivalAP7540.7HTC (cascade)
2D Object DetectionCOCO minivalAPL52.3HTC (cascade)
2D Object DetectionCOCO minivalAPM40.9HTC (cascade)
2D Object DetectionCOCO minivalAPS20.3HTC (cascade)
2D Object DetectionCOCO minivalbox AP43.2HTC (cascade)
16kCOCO test-devAP5063.9HTC (ResNeXt-101-FPN)
16kCOCO test-devAP7544.7HTC (ResNeXt-101-FPN)
16kCOCO test-devAPL54.6HTC (ResNeXt-101-FPN)
16kCOCO test-devAPM43.9HTC (ResNeXt-101-FPN)
16kCOCO test-devAPS22.8HTC (ResNeXt-101-FPN)
16kCOCO test-devbox mAP47.1HTC (ResNeXt-101-FPN)
16kCOCO-OAverage mAP19.1HTC (ResNet-50)
16kCOCO-OEffective Robustness0.08HTC (ResNet-50)
16kCOCO minivalAP5059.4HTC (cascade)
16kCOCO minivalAP7540.7HTC (cascade)
16kCOCO minivalAPL52.3HTC (cascade)
16kCOCO minivalAPM40.9HTC (cascade)
16kCOCO minivalAPS20.3HTC (cascade)
16kCOCO minivalbox AP43.2HTC (cascade)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17