TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cross-Modality Knowledge Distillation Network for Monocula...

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Yu Hong, Hang Dai, Yong Ding

2022-11-14Monocular 3D Object DetectionKnowledge Distillationobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e.g., Pseudo-LiDAR methods. However, the existing methods usually apply non-end-to-end training strategies and insufficiently leverage the LiDAR information, where the rich potential of the LiDAR data has not been well exploited. In this paper, we propose the Cross-Modality Knowledge Distillation (CMKD) network for monocular 3D detection to efficiently and directly transfer the knowledge from LiDAR modality to image modality on both features and responses. Moreover, we further extend CMKD as a semi-supervised training framework by distilling knowledge from large-scale unlabeled data and significantly boost the performance. Until submission, CMKD ranks $1^{st}$ among the monocular 3D detectors with publications on both KITTI $test$ set and Waymo $val$ set with significant performance gains compared to previous state-of-the-art methods.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars EasyAP Easy28.55CMKD
Object DetectionKITTI Cyclist ModerateAP Medium6.67CMKD
Object DetectionKITTI Cyclist HardAP Hard6.34CMKD
Object DetectionKITTI Pedestrian EasyAP Easy13.94CMKD
Object DetectionKITTI Pedestrian HardAP Hard7.42CMKD
Object DetectionKITTI Pedestrian ModerateAP Medium8.79CMKD
Object DetectionKITTI Cars HardAP Hard16.77CMKD
Object DetectionKITTI Cyclist EasyAP Easy12.52CMKD
3DKITTI Cars EasyAP Easy28.55CMKD
3DKITTI Cyclist ModerateAP Medium6.67CMKD
3DKITTI Cyclist HardAP Hard6.34CMKD
3DKITTI Pedestrian EasyAP Easy13.94CMKD
3DKITTI Pedestrian HardAP Hard7.42CMKD
3DKITTI Pedestrian ModerateAP Medium8.79CMKD
3DKITTI Cars HardAP Hard16.77CMKD
3DKITTI Cyclist EasyAP Easy12.52CMKD
3D Object DetectionKITTI Cars EasyAP Easy28.55CMKD
3D Object DetectionKITTI Cyclist ModerateAP Medium6.67CMKD
3D Object DetectionKITTI Cyclist HardAP Hard6.34CMKD
3D Object DetectionKITTI Pedestrian EasyAP Easy13.94CMKD
3D Object DetectionKITTI Pedestrian HardAP Hard7.42CMKD
3D Object DetectionKITTI Pedestrian ModerateAP Medium8.79CMKD
3D Object DetectionKITTI Cars HardAP Hard16.77CMKD
3D Object DetectionKITTI Cyclist EasyAP Easy12.52CMKD
2D ClassificationKITTI Cars EasyAP Easy28.55CMKD
2D ClassificationKITTI Cyclist ModerateAP Medium6.67CMKD
2D ClassificationKITTI Cyclist HardAP Hard6.34CMKD
2D ClassificationKITTI Pedestrian EasyAP Easy13.94CMKD
2D ClassificationKITTI Pedestrian HardAP Hard7.42CMKD
2D ClassificationKITTI Pedestrian ModerateAP Medium8.79CMKD
2D ClassificationKITTI Cars HardAP Hard16.77CMKD
2D ClassificationKITTI Cyclist EasyAP Easy12.52CMKD
2D Object DetectionKITTI Cars EasyAP Easy28.55CMKD
2D Object DetectionKITTI Cyclist ModerateAP Medium6.67CMKD
2D Object DetectionKITTI Cyclist HardAP Hard6.34CMKD
2D Object DetectionKITTI Pedestrian EasyAP Easy13.94CMKD
2D Object DetectionKITTI Pedestrian HardAP Hard7.42CMKD
2D Object DetectionKITTI Pedestrian ModerateAP Medium8.79CMKD
2D Object DetectionKITTI Cars HardAP Hard16.77CMKD
2D Object DetectionKITTI Cyclist EasyAP Easy12.52CMKD
16kKITTI Cars EasyAP Easy28.55CMKD
16kKITTI Cyclist ModerateAP Medium6.67CMKD
16kKITTI Cyclist HardAP Hard6.34CMKD
16kKITTI Pedestrian EasyAP Easy13.94CMKD
16kKITTI Pedestrian HardAP Hard7.42CMKD
16kKITTI Pedestrian ModerateAP Medium8.79CMKD
16kKITTI Cars HardAP Hard16.77CMKD
16kKITTI Cyclist EasyAP Easy12.52CMKD

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16