TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/End-to-End Semi-Supervised Object Detection with Soft Teac...

End-to-End Semi-Supervised Object Detection with Soft Teacher

Mengde Xu, Zheng Zhang, Han Hu, JianFeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu

2021-06-16ICCV 2021 10Semantic SegmentationInstance Segmentationobject-detectionObject DetectionSemi-Supervised Object Detection
PaperPDFCode(official)CodeCodeCodeCodeCodeCodeCode

Abstract

This paper presents an end-to-end semi-supervised object detection approach, in contrast to previous more complex multi-stage methods. The end-to-end training gradually improves pseudo label qualities during the curriculum, and the more and more accurate pseudo labels in turn benefit object detection training. We also propose two simple yet effective techniques within this framework: a soft teacher mechanism where the classification loss of each unlabeled bounding box is weighed by the classification score produced by the teacher network; a box jittering approach to select reliable pseudo boxes for the learning of box regression. On the COCO benchmark, the proposed approach outperforms previous methods by a large margin under various labeling ratios, i.e. 1\%, 5\% and 10\%. Moreover, our approach proves to perform also well when the amount of labeled data is relatively large. For example, it can improve a 40.9 mAP baseline detector trained using the full COCO training set by +3.6 mAP, reaching 44.5 mAP, by leveraging the 123K unlabeled images of COCO. On the state-of-the-art Swin Transformer based object detector (58.9 mAP on test-dev), it can still significantly improve the detection accuracy by +1.5 mAP, reaching 60.4 mAP, and improve the instance segmentation accuracy by +1.2 mAP, reaching 52.4 mAP. Further incorporating with the Object365 pre-trained model, the detection accuracy reaches 61.3 mAP and the instance segmentation accuracy reaches 53.0 mAP, pushing the new state-of-the-art.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devbox mAP61.3Soft Teacher + Swin-L (HTC++, multi-scale)
Object DetectionCOCO minivalbox AP60.7Soft Teacher + Swin-L (HTC++, multi-scale)
Object DetectionCOCO minivalbox AP60.1Soft Teacher+Swin-L(HTC++, single scale)
3DCOCO test-devbox mAP61.3Soft Teacher + Swin-L (HTC++, multi-scale)
3DCOCO minivalbox AP60.7Soft Teacher + Swin-L (HTC++, multi-scale)
3DCOCO minivalbox AP60.1Soft Teacher+Swin-L(HTC++, single scale)
Instance SegmentationCOCO minivalmask AP52.5Soft Teacher + Swin-L(HTC++, multi-scale)
Instance SegmentationCOCO minivalmask AP51.9Soft Teacher + Swin-L(HTC++, single-scale)
Instance SegmentationCOCO test-devmask AP53Soft Teacher + Swin-L (HTC++, multi-scale)
Semi-Supervised Object DetectionCOCO 100% labeled datamAP44.9Soft Teacher
Semi-Supervised Object DetectionCOCO 10% labeled datamAP34.04Soft Teacher
Semi-Supervised Object DetectionCOCO 5% labeled datamAP30.74Soft Teacher + Swin-L(HTC++, multi-scale)
Semi-Supervised Object DetectionCOCO 1% labeled datamAP20.46Soft Teacher + Swin-L(HTC++, multi-scale)
2D ClassificationCOCO test-devbox mAP61.3Soft Teacher + Swin-L (HTC++, multi-scale)
2D ClassificationCOCO minivalbox AP60.7Soft Teacher + Swin-L (HTC++, multi-scale)
2D ClassificationCOCO minivalbox AP60.1Soft Teacher+Swin-L(HTC++, single scale)
2D Object DetectionCOCO test-devbox mAP61.3Soft Teacher + Swin-L (HTC++, multi-scale)
2D Object DetectionCOCO minivalbox AP60.7Soft Teacher + Swin-L (HTC++, multi-scale)
2D Object DetectionCOCO minivalbox AP60.1Soft Teacher+Swin-L(HTC++, single scale)
2D Object DetectionCOCO 100% labeled datamAP44.9Soft Teacher
2D Object DetectionCOCO 10% labeled datamAP34.04Soft Teacher
2D Object DetectionCOCO 5% labeled datamAP30.74Soft Teacher + Swin-L(HTC++, multi-scale)
2D Object DetectionCOCO 1% labeled datamAP20.46Soft Teacher + Swin-L(HTC++, multi-scale)
16kCOCO test-devbox mAP61.3Soft Teacher + Swin-L (HTC++, multi-scale)
16kCOCO minivalbox AP60.7Soft Teacher + Swin-L (HTC++, multi-scale)
16kCOCO minivalbox AP60.1Soft Teacher+Swin-L(HTC++, single scale)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17