TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/UniDet3D: Multi-dataset Indoor 3D Object Detection

UniDet3D: Multi-dataset Indoor 3D Object Detection

Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin, Danila Rukhovich, Anton Konushin

2024-09-06object-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Growing customer demand for smart solutions in robotics and augmented reality has attracted considerable attention to 3D object detection from point clouds. Yet, existing indoor datasets taken individually are too small and insufficiently diverse to train a powerful and general 3D object detection model. In the meantime, more general approaches utilizing foundation models are still inferior in quality to those based on supervised training for a specific task. In this work, we propose \ours{}, a simple yet effective 3D object detection model, which is trained on a mixture of indoor datasets and is capable of working in various indoor environments. By unifying different label spaces, \ours{} enables learning a strong representation across multiple datasets through a supervised joint training scheme. The proposed network architecture is built upon a vanilla transformer encoder, making it easy to run, customize and extend the prediction pipeline for practical use. Extensive experiments demonstrate that \ours{} obtains significant gains over existing 3D object detection methods in 6 indoor benchmarks: ScanNet (+1.1 mAP50), ARKitScenes (+19.4 mAP25), S3DIS (+9.1 mAP50), MultiScan (+9.3 mAP50), 3RScan (+3.2 mAP50), and ScanNet++ (+2.7 mAP50). Code is available at https://github.com/filapro/unidet3d .

Results

TaskDatasetMetricValueModel
Object DetectionARKitScenesmAP@0.2561.3UniDet3D
Object DetectionARKitScenesmAP@0.547.1UniDet3D
Object Detection3RScanmAP@0.2564.7UniDet3D
Object Detection3RScanmAP@0.548.6UniDet3D
Object DetectionScanNet++mAP@0.2526.4UniDet3D
Object DetectionScanNet++mAP@0.517.2UniDet3D
Object DetectionS3DISmAP@0.2575.2UniDet3D
Object DetectionS3DISmAP@0.560.8UniDet3D
Object DetectionMultiScanmAP@0.2564.2UniDet3D
Object DetectionMultiScanmAP@0.551.6UniDet3D
Object DetectionScanNetV2mAP@0.2577.5UniDet3D
Object DetectionScanNetV2mAP@0.566.1UniDet3D
3DARKitScenesmAP@0.2561.3UniDet3D
3DARKitScenesmAP@0.547.1UniDet3D
3D3RScanmAP@0.2564.7UniDet3D
3D3RScanmAP@0.548.6UniDet3D
3DScanNet++mAP@0.2526.4UniDet3D
3DScanNet++mAP@0.517.2UniDet3D
3DS3DISmAP@0.2575.2UniDet3D
3DS3DISmAP@0.560.8UniDet3D
3DMultiScanmAP@0.2564.2UniDet3D
3DMultiScanmAP@0.551.6UniDet3D
3DScanNetV2mAP@0.2577.5UniDet3D
3DScanNetV2mAP@0.566.1UniDet3D
3D Object DetectionARKitScenesmAP@0.2561.3UniDet3D
3D Object DetectionARKitScenesmAP@0.547.1UniDet3D
3D Object Detection3RScanmAP@0.2564.7UniDet3D
3D Object Detection3RScanmAP@0.548.6UniDet3D
3D Object DetectionScanNet++mAP@0.2526.4UniDet3D
3D Object DetectionScanNet++mAP@0.517.2UniDet3D
3D Object DetectionS3DISmAP@0.2575.2UniDet3D
3D Object DetectionS3DISmAP@0.560.8UniDet3D
3D Object DetectionMultiScanmAP@0.2564.2UniDet3D
3D Object DetectionMultiScanmAP@0.551.6UniDet3D
3D Object DetectionScanNetV2mAP@0.2577.5UniDet3D
3D Object DetectionScanNetV2mAP@0.566.1UniDet3D
2D ClassificationARKitScenesmAP@0.2561.3UniDet3D
2D ClassificationARKitScenesmAP@0.547.1UniDet3D
2D Classification3RScanmAP@0.2564.7UniDet3D
2D Classification3RScanmAP@0.548.6UniDet3D
2D ClassificationScanNet++mAP@0.2526.4UniDet3D
2D ClassificationScanNet++mAP@0.517.2UniDet3D
2D ClassificationS3DISmAP@0.2575.2UniDet3D
2D ClassificationS3DISmAP@0.560.8UniDet3D
2D ClassificationMultiScanmAP@0.2564.2UniDet3D
2D ClassificationMultiScanmAP@0.551.6UniDet3D
2D ClassificationScanNetV2mAP@0.2577.5UniDet3D
2D ClassificationScanNetV2mAP@0.566.1UniDet3D
2D Object DetectionARKitScenesmAP@0.2561.3UniDet3D
2D Object DetectionARKitScenesmAP@0.547.1UniDet3D
2D Object Detection3RScanmAP@0.2564.7UniDet3D
2D Object Detection3RScanmAP@0.548.6UniDet3D
2D Object DetectionScanNet++mAP@0.2526.4UniDet3D
2D Object DetectionScanNet++mAP@0.517.2UniDet3D
2D Object DetectionS3DISmAP@0.2575.2UniDet3D
2D Object DetectionS3DISmAP@0.560.8UniDet3D
2D Object DetectionMultiScanmAP@0.2564.2UniDet3D
2D Object DetectionMultiScanmAP@0.551.6UniDet3D
2D Object DetectionScanNetV2mAP@0.2577.5UniDet3D
2D Object DetectionScanNetV2mAP@0.566.1UniDet3D
16kARKitScenesmAP@0.2561.3UniDet3D
16kARKitScenesmAP@0.547.1UniDet3D
16k3RScanmAP@0.2564.7UniDet3D
16k3RScanmAP@0.548.6UniDet3D
16kScanNet++mAP@0.2526.4UniDet3D
16kScanNet++mAP@0.517.2UniDet3D
16kS3DISmAP@0.2575.2UniDet3D
16kS3DISmAP@0.560.8UniDet3D
16kMultiScanmAP@0.2564.2UniDet3D
16kMultiScanmAP@0.551.6UniDet3D
16kScanNetV2mAP@0.2577.5UniDet3D
16kScanNetV2mAP@0.566.1UniDet3D

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07