TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/RTMDet: An Empirical Study of Designing Real-Time Object D...

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, Kai Chen

2022-12-14Real-time Instance SegmentationObject Detection In Aerial ImagesObject RecognitionReal-Time Object DetectionSemantic SegmentationOne-stage Anchor-free Oriented Object DetectionInstance SegmentationOriented Object Detectionobject-detectionObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCode

Abstract

In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection. To obtain a more efficient model architecture, we explore an architecture that has compatible capacities in the backbone and neck, constructed by a basic building block that consists of large-kernel depth-wise convolutions. We further introduce soft labels when calculating matching costs in the dynamic label assignment to improve accuracy. Together with better training techniques, the resulting object detector, named RTMDet, achieves 52.8% AP on COCO with 300+ FPS on an NVIDIA 3090 GPU, outperforming the current mainstream industrial detectors. RTMDet achieves the best parameter-accuracy trade-off with tiny/small/medium/large/extra-large model sizes for various application scenarios, and obtains new state-of-the-art performance on real-time instance segmentation and rotated object detection. We hope the experimental results can provide new insights into designing versatile real-time object detectors for many object recognition tasks. Code and models are released at https://github.com/open-mmlab/mmdetection/tree/3.x/configs/rtmdet.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO (Common Objects in Context)box AP52.8RTMDet
Object DetectionHRSC2016mAP-0790.6RTMDet-R-tiny
Object DetectionHRSC2016mAP-1297.1RTMDet-R-tiny
3DCOCO (Common Objects in Context)box AP52.8RTMDet
3DHRSC2016mAP-0790.6RTMDet-R-tiny
3DHRSC2016mAP-1297.1RTMDet-R-tiny
Instance SegmentationMSCOCO-1kAPM49RTMDet-Ins-x
Instance Segmentationmulti30k_test_2017_mscocomask AP38.7RTMDet-Ins-s
Instance SegmentationMSCOCOAP5067.4RTMDet-Ins-x
Instance SegmentationMSCOCOAP7547.8RTMDet-Ins-x
Instance SegmentationMSCOCOAPL65.5RTMDet-Ins-x
Instance SegmentationMSCOCOAPS22.2RTMDet-Ins-x
Instance SegmentationMSCOCOmask AP44.6RTMDet-Ins-x
Instance SegmentationMSCOCOAP5066RTMDet-Ins-l
Instance SegmentationMSCOCOAP7547RTMDet-Ins-l
Instance SegmentationMSCOCOAPL64.8RTMDet-Ins-l
Instance SegmentationMSCOCOAPM48RTMDet-Ins-l
Instance SegmentationMSCOCOAPS20.8RTMDet-Ins-l
Instance SegmentationMSCOCOmask AP43.7RTMDet-Ins-l
Instance SegmentationMSCOCOAP5063.9RTMDet-Ins-m
Instance SegmentationMSCOCOAP7545.1RTMDet-Ins-m
Instance SegmentationMSCOCOAPL63.1RTMDet-Ins-m
Instance SegmentationMSCOCOAPM46.4RTMDet-Ins-m
Instance SegmentationMSCOCOAPS19.3RTMDet-Ins-m
Instance SegmentationMSCOCOmask AP42.1RTMDet-Ins-m
Instance SegmentationMSCOCOAP5059.3RTMDet-Ins-s
Instance SegmentationMSCOCOAP7541.3RTMDet-Ins-s
Instance SegmentationMSCOCOAPL60.3RTMDet-Ins-s
Instance SegmentationMSCOCOAPM42.3RTMDet-Ins-s
Instance SegmentationMSCOCOAPS15.1RTMDet-Ins-s
2D ClassificationCOCO (Common Objects in Context)box AP52.8RTMDet
2D ClassificationHRSC2016mAP-0790.6RTMDet-R-tiny
2D ClassificationHRSC2016mAP-1297.1RTMDet-R-tiny
2D Object DetectionCOCO (Common Objects in Context)box AP52.8RTMDet
2D Object DetectionHRSC2016mAP-0790.6RTMDet-R-tiny
2D Object DetectionHRSC2016mAP-1297.1RTMDet-R-tiny
16kCOCO (Common Objects in Context)box AP52.8RTMDet
16kHRSC2016mAP-0790.6RTMDet-R-tiny
16kHRSC2016mAP-1297.1RTMDet-R-tiny

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17