TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SMOKE: Single-Stage Monocular 3D Object Detection via Keyp...

SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation

Zechen Liu, Zizhang Wu, Roland Tóth

2020-02-24Monocular 3D Object DetectionTranslationAutonomous Navigationobject-detection3D Object DetectionObject DetectionKeypoint Estimation
PaperPDFCodeCode(official)Code

Abstract

Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous navigation and driving. In case of monocular vision, successful methods have been mainly based on two ingredients: (i) a network generating 2D region proposals, (ii) a R-CNN structure predicting 3D object pose by utilizing the acquired regions of interest. We argue that the 2D detection network is redundant and introduces non-negligible noise for 3D detection. Hence, we propose a novel 3D object detection method, named SMOKE, in this paper that predicts a 3D bounding box for each detected object by combining a single keypoint estimate with regressed 3D variables. As a second contribution, we propose a multi-step disentangling approach for constructing the 3D bounding box, which significantly improves both training convergence and detection accuracy. In contrast to previous 3D detection techniques, our method does not require complicated pre/post-processing, extra data, and a refinement stage. Despite of its structural simplicity, our proposed SMOKE network outperforms all existing monocular 3D detection methods on the KITTI dataset, giving the best state-of-the-art result on both 3D object detection and Bird's eye view evaluation. The code will be made publicly available.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP Medium9.76SMOKE
3DKITTI Cars ModerateAP Medium9.76SMOKE
3D Object DetectionKITTI Cars ModerateAP Medium9.76SMOKE
2D ClassificationKITTI Cars ModerateAP Medium9.76SMOKE
2D Object DetectionKITTI Cars ModerateAP Medium9.76SMOKE
16kKITTI Cars ModerateAP Medium9.76SMOKE

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Function-to-Style Guidance of LLMs for Code Translation2025-07-15