TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SOLQ: Segmenting Objects by Learning Queries

SOLQ: Segmenting Objects by Learning Queries

Bin Dong, Fangao Zeng, Tiancai Wang, Xiangyu Zhang, Yichen Wei

2021-06-04NeurIPS 2021 12SegmentationSemantic SegmentationInstance SegmentationObject Detection
PaperPDFCode(official)

Abstract

In this paper, we propose an end-to-end framework for instance segmentation. Based on the recently introduced DETR [1], our method, termed SOLQ, segments objects by learning unified queries. In SOLQ, each query represents one object and has multiple representations: class, location and mask. The object queries learned perform classification, box regression and mask encoding simultaneously in an unified vector form. During training phase, the mask vectors encoded are supervised by the compression coding of raw spatial masks. In inference time, mask vectors produced can be directly transformed to spatial masks by the inverse process of compression coding. Experimental results show that SOLQ can achieve state-of-the-art performance, surpassing most of existing approaches. Moreover, the joint learning of unified query representation can greatly improve the detection performance of DETR. We hope our SOLQ can serve as a strong baseline for the Transformer-based instance segmentation. Code is available at https://github.com/megvii-research/SOLQ.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5074.6SOLQ (Swin-L, single scale)
Object DetectionCOCO test-devAP7560.5SOLQ (Swin-L, single scale)
Object DetectionCOCO test-devAPL70.6SOLQ (Swin-L, single scale)
Object DetectionCOCO test-devAPM60SOLQ (Swin-L, single scale)
Object DetectionCOCO test-devAPS37.6SOLQ (Swin-L, single scale)
Object DetectionCOCO test-devbox mAP56.5SOLQ (Swin-L, single scale)
Object DetectionCOCO test-devbox mAP48.7SOLQ (ResNet101, single scale)
Object DetectionCOCO test-devbox mAP47.8SOLQ (ResNet50, single scale)
Object DetectionCOCO minivalAP5074.9SOLQ (Swin-L, single scale)
Object DetectionCOCO minivalAP7561.3SOLQ (Swin-L, single scale)
Object DetectionCOCO minivalAPL71.9SOLQ (Swin-L, single scale)
3DCOCO test-devAP5074.6SOLQ (Swin-L, single scale)
3DCOCO test-devAP7560.5SOLQ (Swin-L, single scale)
3DCOCO test-devAPL70.6SOLQ (Swin-L, single scale)
3DCOCO test-devAPM60SOLQ (Swin-L, single scale)
3DCOCO test-devAPS37.6SOLQ (Swin-L, single scale)
3DCOCO test-devbox mAP56.5SOLQ (Swin-L, single scale)
3DCOCO test-devbox mAP48.7SOLQ (ResNet101, single scale)
3DCOCO test-devbox mAP47.8SOLQ (ResNet50, single scale)
3DCOCO minivalAP5074.9SOLQ (Swin-L, single scale)
3DCOCO minivalAP7561.3SOLQ (Swin-L, single scale)
3DCOCO minivalAPL71.9SOLQ (Swin-L, single scale)
Instance SegmentationCOCO test-devmask AP46.7SOLQ (Swin-L, single scale)
Instance SegmentationCOCO test-devmask AP40.9SOLQ (ResNet101, single scale)
Instance SegmentationCOCO test-devmask AP39.7SOLQ (ResNet50, single scale)
2D ClassificationCOCO test-devAP5074.6SOLQ (Swin-L, single scale)
2D ClassificationCOCO test-devAP7560.5SOLQ (Swin-L, single scale)
2D ClassificationCOCO test-devAPL70.6SOLQ (Swin-L, single scale)
2D ClassificationCOCO test-devAPM60SOLQ (Swin-L, single scale)
2D ClassificationCOCO test-devAPS37.6SOLQ (Swin-L, single scale)
2D ClassificationCOCO test-devbox mAP56.5SOLQ (Swin-L, single scale)
2D ClassificationCOCO test-devbox mAP48.7SOLQ (ResNet101, single scale)
2D ClassificationCOCO test-devbox mAP47.8SOLQ (ResNet50, single scale)
2D ClassificationCOCO minivalAP5074.9SOLQ (Swin-L, single scale)
2D ClassificationCOCO minivalAP7561.3SOLQ (Swin-L, single scale)
2D ClassificationCOCO minivalAPL71.9SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devAP5074.6SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devAP7560.5SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devAPL70.6SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devAPM60SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devAPS37.6SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devbox mAP56.5SOLQ (Swin-L, single scale)
2D Object DetectionCOCO test-devbox mAP48.7SOLQ (ResNet101, single scale)
2D Object DetectionCOCO test-devbox mAP47.8SOLQ (ResNet50, single scale)
2D Object DetectionCOCO minivalAP5074.9SOLQ (Swin-L, single scale)
2D Object DetectionCOCO minivalAP7561.3SOLQ (Swin-L, single scale)
2D Object DetectionCOCO minivalAPL71.9SOLQ (Swin-L, single scale)
16kCOCO test-devAP5074.6SOLQ (Swin-L, single scale)
16kCOCO test-devAP7560.5SOLQ (Swin-L, single scale)
16kCOCO test-devAPL70.6SOLQ (Swin-L, single scale)
16kCOCO test-devAPM60SOLQ (Swin-L, single scale)
16kCOCO test-devAPS37.6SOLQ (Swin-L, single scale)
16kCOCO test-devbox mAP56.5SOLQ (Swin-L, single scale)
16kCOCO test-devbox mAP48.7SOLQ (ResNet101, single scale)
16kCOCO test-devbox mAP47.8SOLQ (ResNet50, single scale)
16kCOCO minivalAP5074.9SOLQ (Swin-L, single scale)
16kCOCO minivalAP7561.3SOLQ (Swin-L, single scale)
16kCOCO minivalAPL71.9SOLQ (Swin-L, single scale)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17