TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FCOS3D: Fully Convolutional One-Stage Monocular 3D Object ...

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection

Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin

2021-04-22Monocular 3D Object DetectionAutonomous Drivingobject-detection3D Object DetectionObject Detection
PaperPDFCodeCodeCode(official)CodeCodeCodeCodeCodeCode

Abstract

Monocular 3D object detection is an important task for autonomous driving considering its advantage of low cost. It is much more challenging than conventional 2D cases due to its inherent ill-posed property, which is mainly reflected in the lack of depth information. Recent progress on 2D detection offers opportunities to better solving this problem. However, it is non-trivial to make a general adapted 2D detector work in this 3D task. In this paper, we study this problem with a practice built on a fully convolutional single-stage detector and propose a general framework FCOS3D. Specifically, we first transform the commonly defined 7-DoF 3D targets to the image domain and decouple them as 2D and 3D attributes. Then the objects are distributed to different feature levels with consideration of their 2D scales and assigned only according to the projected 3D-center for the training procedure. Furthermore, the center-ness is redefined with a 2D Gaussian distribution based on the 3D-center to fit the 3D target formulation. All of these make this framework simple yet effective, getting rid of any 2D detection or 2D-3D correspondence priors. Our solution achieves 1st place out of all the vision-only methods in the nuScenes 3D detection challenge of NeurIPS 2020. Code and models are released at https://github.com/open-mmlab/mmdetection3d.

Results

TaskDatasetMetricValueModel
Object DetectionnuScenesNDS0.428FCOS3D
Object DetectionnuScenesmAAE0.124FCOS3D
Object DetectionnuScenesmAOE0.452FCOS3D
Object DetectionnuScenesmAP0.358FCOS3D
Object DetectionnuScenesmASE0.249FCOS3D
Object DetectionnuScenesmATE0.69FCOS3D
Object DetectionnuScenesmAVE1.434FCOS3D
3DnuScenesNDS0.428FCOS3D
3DnuScenesmAAE0.124FCOS3D
3DnuScenesmAOE0.452FCOS3D
3DnuScenesmAP0.358FCOS3D
3DnuScenesmASE0.249FCOS3D
3DnuScenesmATE0.69FCOS3D
3DnuScenesmAVE1.434FCOS3D
3D Object DetectionnuScenesNDS0.428FCOS3D
3D Object DetectionnuScenesmAAE0.124FCOS3D
3D Object DetectionnuScenesmAOE0.452FCOS3D
3D Object DetectionnuScenesmAP0.358FCOS3D
3D Object DetectionnuScenesmASE0.249FCOS3D
3D Object DetectionnuScenesmATE0.69FCOS3D
3D Object DetectionnuScenesmAVE1.434FCOS3D
2D ClassificationnuScenesNDS0.428FCOS3D
2D ClassificationnuScenesmAAE0.124FCOS3D
2D ClassificationnuScenesmAOE0.452FCOS3D
2D ClassificationnuScenesmAP0.358FCOS3D
2D ClassificationnuScenesmASE0.249FCOS3D
2D ClassificationnuScenesmATE0.69FCOS3D
2D ClassificationnuScenesmAVE1.434FCOS3D
2D Object DetectionnuScenesNDS0.428FCOS3D
2D Object DetectionnuScenesmAAE0.124FCOS3D
2D Object DetectionnuScenesmAOE0.452FCOS3D
2D Object DetectionnuScenesmAP0.358FCOS3D
2D Object DetectionnuScenesmASE0.249FCOS3D
2D Object DetectionnuScenesmATE0.69FCOS3D
2D Object DetectionnuScenesmAVE1.434FCOS3D
16knuScenesNDS0.428FCOS3D
16knuScenesmAAE0.124FCOS3D
16knuScenesmAOE0.452FCOS3D
16knuScenesmAP0.358FCOS3D
16knuScenesmASE0.249FCOS3D
16knuScenesmATE0.69FCOS3D
16knuScenesmAVE1.434FCOS3D

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17