TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/M3DeTR: Multi-representation, Multi-scale, Mutual-relation...

M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers

Tianrui Guan, Jun Wang, Shiyi Lan, Rohan Chandra, Zuxuan Wu, Larry Davis, Dinesh Manocha

2021-04-24object-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

We present a novel architecture for 3D object detection, M3DeTR, which combines different point cloud representations (raw, voxels, bird-eye view) with different feature scales based on multi-scale feature pyramids. M3DeTR is the first approach that unifies multiple point cloud representations, feature scales, as well as models mutual relationships between point clouds simultaneously using transformers. We perform extensive ablation experiments that highlight the benefits of fusing representation and scale, and modeling the relationships. Our method achieves state-of-the-art performance on the KITTI 3D object detection dataset and Waymo Open Dataset. Results show that M3DeTR improves the baseline significantly by 1.48% mAP for all classes on Waymo Open Dataset. In particular, our approach ranks 1st on the well-known KITTI 3D Detection Benchmark for both car and cyclist classes, and ranks 1st on Waymo Open Dataset with single frame point cloud input. Our code is available at: https://github.com/rayguan97/M3DETR.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars Hard valAP82.85M3DeTR
Object DetectionKITTI Cyclist Moderate valAP71.7M3DeTR
Object DetectionKITTI Cyclist Easy valAP89.13M3DeTR
Object DetectionKITTI Cyclist Hard valAP68.29M3DeTR
Object DetectionKITTI Pedestrian Moderate valAP60.63M3DeTR
Object DetectionKITTI Pedestrian Hard valAP56.49M3DeTR
Object DetectionKITTI Cars Moderate valAP85.41M3DeTR
Object DetectionKITTI Pedestrian Easy valAP67.64M3DeTR
Object DetectionKITTI Cars Easy valAP92.29M3DeTR
Object Detectionwaymo cyclistAPH/L267.28M3DeTR
Object Detectionwaymo vehicleAP77.09M3DeTR
Object Detectionwaymo vehicleAPH/L270.54M3DeTR
Object Detectionwaymo vehicleL1 mAP77.66M3DeTR
Object Detectionwaymo pedestrianAPH/L268.2M3DeTR
3DKITTI Cars Hard valAP82.85M3DeTR
3DKITTI Cyclist Moderate valAP71.7M3DeTR
3DKITTI Cyclist Easy valAP89.13M3DeTR
3DKITTI Cyclist Hard valAP68.29M3DeTR
3DKITTI Pedestrian Moderate valAP60.63M3DeTR
3DKITTI Pedestrian Hard valAP56.49M3DeTR
3DKITTI Cars Moderate valAP85.41M3DeTR
3DKITTI Pedestrian Easy valAP67.64M3DeTR
3DKITTI Cars Easy valAP92.29M3DeTR
3Dwaymo cyclistAPH/L267.28M3DeTR
3Dwaymo vehicleAP77.09M3DeTR
3Dwaymo vehicleAPH/L270.54M3DeTR
3Dwaymo vehicleL1 mAP77.66M3DeTR
3Dwaymo pedestrianAPH/L268.2M3DeTR
3D Object DetectionKITTI Cars Hard valAP82.85M3DeTR
3D Object DetectionKITTI Cyclist Moderate valAP71.7M3DeTR
3D Object DetectionKITTI Cyclist Easy valAP89.13M3DeTR
3D Object DetectionKITTI Cyclist Hard valAP68.29M3DeTR
3D Object DetectionKITTI Pedestrian Moderate valAP60.63M3DeTR
3D Object DetectionKITTI Pedestrian Hard valAP56.49M3DeTR
3D Object DetectionKITTI Cars Moderate valAP85.41M3DeTR
3D Object DetectionKITTI Pedestrian Easy valAP67.64M3DeTR
3D Object DetectionKITTI Cars Easy valAP92.29M3DeTR
3D Object Detectionwaymo cyclistAPH/L267.28M3DeTR
3D Object Detectionwaymo vehicleAP77.09M3DeTR
3D Object Detectionwaymo vehicleAPH/L270.54M3DeTR
3D Object Detectionwaymo vehicleL1 mAP77.66M3DeTR
3D Object Detectionwaymo pedestrianAPH/L268.2M3DeTR
2D ClassificationKITTI Cars Hard valAP82.85M3DeTR
2D ClassificationKITTI Cyclist Moderate valAP71.7M3DeTR
2D ClassificationKITTI Cyclist Easy valAP89.13M3DeTR
2D ClassificationKITTI Cyclist Hard valAP68.29M3DeTR
2D ClassificationKITTI Pedestrian Moderate valAP60.63M3DeTR
2D ClassificationKITTI Pedestrian Hard valAP56.49M3DeTR
2D ClassificationKITTI Cars Moderate valAP85.41M3DeTR
2D ClassificationKITTI Pedestrian Easy valAP67.64M3DeTR
2D ClassificationKITTI Cars Easy valAP92.29M3DeTR
2D Classificationwaymo cyclistAPH/L267.28M3DeTR
2D Classificationwaymo vehicleAP77.09M3DeTR
2D Classificationwaymo vehicleAPH/L270.54M3DeTR
2D Classificationwaymo vehicleL1 mAP77.66M3DeTR
2D Classificationwaymo pedestrianAPH/L268.2M3DeTR
2D Object DetectionKITTI Cars Hard valAP82.85M3DeTR
2D Object DetectionKITTI Cyclist Moderate valAP71.7M3DeTR
2D Object DetectionKITTI Cyclist Easy valAP89.13M3DeTR
2D Object DetectionKITTI Cyclist Hard valAP68.29M3DeTR
2D Object DetectionKITTI Pedestrian Moderate valAP60.63M3DeTR
2D Object DetectionKITTI Pedestrian Hard valAP56.49M3DeTR
2D Object DetectionKITTI Cars Moderate valAP85.41M3DeTR
2D Object DetectionKITTI Pedestrian Easy valAP67.64M3DeTR
2D Object DetectionKITTI Cars Easy valAP92.29M3DeTR
2D Object Detectionwaymo cyclistAPH/L267.28M3DeTR
2D Object Detectionwaymo vehicleAP77.09M3DeTR
2D Object Detectionwaymo vehicleAPH/L270.54M3DeTR
2D Object Detectionwaymo vehicleL1 mAP77.66M3DeTR
2D Object Detectionwaymo pedestrianAPH/L268.2M3DeTR
16kKITTI Cars Hard valAP82.85M3DeTR
16kKITTI Cyclist Moderate valAP71.7M3DeTR
16kKITTI Cyclist Easy valAP89.13M3DeTR
16kKITTI Cyclist Hard valAP68.29M3DeTR
16kKITTI Pedestrian Moderate valAP60.63M3DeTR
16kKITTI Pedestrian Hard valAP56.49M3DeTR
16kKITTI Cars Moderate valAP85.41M3DeTR
16kKITTI Pedestrian Easy valAP67.64M3DeTR
16kKITTI Cars Easy valAP92.29M3DeTR
16kwaymo cyclistAPH/L267.28M3DeTR
16kwaymo vehicleAP77.09M3DeTR
16kwaymo vehicleAPH/L270.54M3DeTR
16kwaymo vehicleL1 mAP77.66M3DeTR
16kwaymo pedestrianAPH/L268.2M3DeTR

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07