TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Vote3Deep: Fast Object Detection in 3D Point Clouds Using ...

Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks

Martin Engelcke, Dushyant Rao, Dominic Zeng Wang, Chi Hay Tong, Ingmar Posner

2016-09-21Real-Time Object Detectionobject-detection3D Object DetectionObject Detection
PaperPDF

Abstract

This paper proposes a computationally efficient approach to detecting objects natively in 3D point clouds using convolutional neural networks (CNNs). In particular, this is achieved by leveraging a feature-centric voting scheme to implement novel convolutional layers which explicitly exploit the sparsity encountered in the input. To this end, we examine the trade-off between accuracy and speed for different architectures and additionally propose to use an L1 penalty on the filter activations to further encourage sparsity in the intermediate representations. To the best of our knowledge, this is the first work to propose sparse convolutional layers and L1 regularisation for efficient large-scale processing of 3D data. We demonstrate the efficacy of our approach on the KITTI object detection benchmark and show that Vote3Deep models with as few as three layers outperform the previous state of the art in both laser and laser-vision based approaches by margins of up to 40% while remaining highly competitive in terms of processing time.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP68.24Vote3Deep
Object DetectionKITTI Cyclists ModerateAP67.88Vote3Deep
Object DetectionKITTI Pedestrians ModerateAP55.37Vote3Deep
Object DetectionKITTI Cyclists HardAP62.98Vote3Deep
Object DetectionKITTI Cars HardAP63.23Vote3Deep
Object DetectionKITTI Cyclists EasyAP79.92Vote3Deep
Object DetectionKITTI Pedestrians EasyAP68.39Vote3Deep
Object DetectionKITTI Cars EasyAP76.79Vote3Deep
Object DetectionKITTI Pedestrians HardAP52.59Vote3Deep
3DKITTI Cars ModerateAP68.24Vote3Deep
3DKITTI Cyclists ModerateAP67.88Vote3Deep
3DKITTI Pedestrians ModerateAP55.37Vote3Deep
3DKITTI Cyclists HardAP62.98Vote3Deep
3DKITTI Cars HardAP63.23Vote3Deep
3DKITTI Cyclists EasyAP79.92Vote3Deep
3DKITTI Pedestrians EasyAP68.39Vote3Deep
3DKITTI Cars EasyAP76.79Vote3Deep
3DKITTI Pedestrians HardAP52.59Vote3Deep
2D ClassificationKITTI Cars ModerateAP68.24Vote3Deep
2D ClassificationKITTI Cyclists ModerateAP67.88Vote3Deep
2D ClassificationKITTI Pedestrians ModerateAP55.37Vote3Deep
2D ClassificationKITTI Cyclists HardAP62.98Vote3Deep
2D ClassificationKITTI Cars HardAP63.23Vote3Deep
2D ClassificationKITTI Cyclists EasyAP79.92Vote3Deep
2D ClassificationKITTI Pedestrians EasyAP68.39Vote3Deep
2D ClassificationKITTI Cars EasyAP76.79Vote3Deep
2D ClassificationKITTI Pedestrians HardAP52.59Vote3Deep
2D Object DetectionKITTI Cars ModerateAP68.24Vote3Deep
2D Object DetectionKITTI Cyclists ModerateAP67.88Vote3Deep
2D Object DetectionKITTI Pedestrians ModerateAP55.37Vote3Deep
2D Object DetectionKITTI Cyclists HardAP62.98Vote3Deep
2D Object DetectionKITTI Cars HardAP63.23Vote3Deep
2D Object DetectionKITTI Cyclists EasyAP79.92Vote3Deep
2D Object DetectionKITTI Pedestrians EasyAP68.39Vote3Deep
2D Object DetectionKITTI Cars EasyAP76.79Vote3Deep
2D Object DetectionKITTI Pedestrians HardAP52.59Vote3Deep
16kKITTI Cars ModerateAP68.24Vote3Deep
16kKITTI Cyclists ModerateAP67.88Vote3Deep
16kKITTI Pedestrians ModerateAP55.37Vote3Deep
16kKITTI Cyclists HardAP62.98Vote3Deep
16kKITTI Cars HardAP63.23Vote3Deep
16kKITTI Cyclists EasyAP79.92Vote3Deep
16kKITTI Pedestrians EasyAP68.39Vote3Deep
16kKITTI Cars EasyAP76.79Vote3Deep
16kKITTI Pedestrians HardAP52.59Vote3Deep

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07