End-to-End Semi-Supervised Object Detection with Soft Teacher

Mengde Xu, Zheng Zhang, Han Hu, JianFeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu

2021-06-16ICCV 2021 10Semantic Segmentation Instance Segmentation object-detection Object Detection Semi-Supervised Object Detection

Paper PDF Code(official)Code Code Code Code Code Code Code

Abstract

This paper presents an end-to-end semi-supervised object detection approach, in contrast to previous more complex multi-stage methods. The end-to-end training gradually improves pseudo label qualities during the curriculum, and the more and more accurate pseudo labels in turn benefit object detection training. We also propose two simple yet effective techniques within this framework: a soft teacher mechanism where the classification loss of each unlabeled bounding box is weighed by the classification score produced by the teacher network; a box jittering approach to select reliable pseudo boxes for the learning of box regression. On the COCO benchmark, the proposed approach outperforms previous methods by a large margin under various labeling ratios, i.e. 1\%, 5\% and 10\%. Moreover, our approach proves to perform also well when the amount of labeled data is relatively large. For example, it can improve a 40.9 mAP baseline detector trained using the full COCO training set by +3.6 mAP, reaching 44.5 mAP, by leveraging the 123K unlabeled images of COCO. On the state-of-the-art Swin Transformer based object detector (58.9 mAP on test-dev), it can still significantly improve the detection accuracy by +1.5 mAP, reaching 60.4 mAP, and improve the instance segmentation accuracy by +1.2 mAP, reaching 52.4 mAP. Further incorporating with the Object365 pre-trained model, the detection accuracy reaches 61.3 mAP and the instance segmentation accuracy reaches 53.0 mAP, pushing the new state-of-the-art.

Results

Task	Dataset	Metric	Value	Model
Object Detection	COCO test-dev	box mAP	61.3	Soft Teacher + Swin-L (HTC++, multi-scale)
Object Detection	COCO minival	box AP	60.7	Soft Teacher + Swin-L (HTC++, multi-scale)
Object Detection	COCO minival	box AP	60.1	Soft Teacher+Swin-L(HTC++, single scale)
3D	COCO test-dev	box mAP	61.3	Soft Teacher + Swin-L (HTC++, multi-scale)
3D	COCO minival	box AP	60.7	Soft Teacher + Swin-L (HTC++, multi-scale)
3D	COCO minival	box AP	60.1	Soft Teacher+Swin-L(HTC++, single scale)
Instance Segmentation	COCO minival	mask AP	52.5	Soft Teacher + Swin-L(HTC++, multi-scale)
Instance Segmentation	COCO minival	mask AP	51.9	Soft Teacher + Swin-L(HTC++, single-scale)
Instance Segmentation	COCO test-dev	mask AP	53	Soft Teacher + Swin-L (HTC++, multi-scale)
Semi-Supervised Object Detection	COCO 100% labeled data	mAP	44.9	Soft Teacher
Semi-Supervised Object Detection	COCO 10% labeled data	mAP	34.04	Soft Teacher
Semi-Supervised Object Detection	COCO 5% labeled data	mAP	30.74	Soft Teacher + Swin-L(HTC++, multi-scale)
Semi-Supervised Object Detection	COCO 1% labeled data	mAP	20.46	Soft Teacher + Swin-L(HTC++, multi-scale)
2D Classification	COCO test-dev	box mAP	61.3	Soft Teacher + Swin-L (HTC++, multi-scale)
2D Classification	COCO minival	box AP	60.7	Soft Teacher + Swin-L (HTC++, multi-scale)
2D Classification	COCO minival	box AP	60.1	Soft Teacher+Swin-L(HTC++, single scale)
2D Object Detection	COCO test-dev	box mAP	61.3	Soft Teacher + Swin-L (HTC++, multi-scale)
2D Object Detection	COCO minival	box AP	60.7	Soft Teacher + Swin-L (HTC++, multi-scale)
2D Object Detection	COCO minival	box AP	60.1	Soft Teacher+Swin-L(HTC++, single scale)
2D Object Detection	COCO 100% labeled data	mAP	44.9	Soft Teacher
2D Object Detection	COCO 10% labeled data	mAP	34.04	Soft Teacher
2D Object Detection	COCO 5% labeled data	mAP	30.74	Soft Teacher + Swin-L(HTC++, multi-scale)
2D Object Detection	COCO 1% labeled data	mAP	20.46	Soft Teacher + Swin-L(HTC++, multi-scale)
16k	COCO test-dev	box mAP	61.3	Soft Teacher + Swin-L (HTC++, multi-scale)
16k	COCO minival	box AP	60.7	Soft Teacher + Swin-L (HTC++, multi-scale)
16k	COCO minival	box AP	60.1	Soft Teacher+Swin-L(HTC++, single scale)

End-to-End Semi-Supervised Object Detection with Soft Teacher

Abstract

Results

Related Papers

End-to-End Semi-Supervised Object Detection with Soft Teacher

Abstract

Results

Related Papers