TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object...

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu

2022-07-213D Object Detection From Monocular ImagesMonocular 3D Object Detection3D Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

Modern neural networks use building blocks such as convolutions that are equivariant to arbitrary 2D translations. However, these vanilla blocks are not equivariant to arbitrary 3D translations in the projective manifold. Even then, all monocular 3D detectors use vanilla blocks to obtain the 3D coordinates, a task for which the vanilla blocks are not designed for. This paper takes the first step towards convolutions equivariant to arbitrary 3D translations in the projective manifold. Since the depth is the hardest to estimate for monocular detection, this paper proposes Depth EquiVarIAnt NeTwork (DEVIANT) built with existing scale equivariant steerable blocks. As a result, DEVIANT is equivariant to the depth translations in the projective manifold whereas vanilla networks are not. The additional depth equivariance forces the DEVIANT to learn consistent depth estimates, and therefore, DEVIANT achieves state-of-the-art monocular 3D detection results on KITTI and Waymo datasets in the image-only category and performs competitively to methods using extra information. Moreover, DEVIANT works better than vanilla networks in cross-dataset evaluation. Code and models at https://github.com/abhi1kumar/DEVIANT

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP Medium14.46DEVIANT
Object DetectionWaymo Open Dataset3D mAPH Vehicle (Front Camera Only)2.52DEVIANT
Object DetectionKITTI-360AP2526.96DEVIANT
Object DetectionKITTI-360AP500.88DEVIANT
3DKITTI Cars ModerateAP Medium14.46DEVIANT
3DWaymo Open Dataset3D mAPH Vehicle (Front Camera Only)2.52DEVIANT
3DKITTI-360AP2526.96DEVIANT
3DKITTI-360AP500.88DEVIANT
3D Object DetectionKITTI Cars ModerateAP Medium14.46DEVIANT
2D ClassificationKITTI Cars ModerateAP Medium14.46DEVIANT
2D ClassificationWaymo Open Dataset3D mAPH Vehicle (Front Camera Only)2.52DEVIANT
2D ClassificationKITTI-360AP2526.96DEVIANT
2D ClassificationKITTI-360AP500.88DEVIANT
2D Object DetectionKITTI Cars ModerateAP Medium14.46DEVIANT
2D Object DetectionWaymo Open Dataset3D mAPH Vehicle (Front Camera Only)2.52DEVIANT
2D Object DetectionKITTI-360AP2526.96DEVIANT
2D Object DetectionKITTI-360AP500.88DEVIANT
16kKITTI Cars ModerateAP Medium14.46DEVIANT
16kWaymo Open Dataset3D mAPH Vehicle (Front Camera Only)2.52DEVIANT
16kKITTI-360AP2526.96DEVIANT
16kKITTI-360AP500.88DEVIANT

Related Papers

Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07