TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/nuScenes: A multimodal dataset for autonomous driving

nuScenes: A multimodal dataset for autonomous driving

Holger Caesar, Varun Bankiti, Alex H. Lang, Sourabh Vora, Venice Erin Liong, Qiang Xu, Anush Krishnan, Yu Pan, Giancarlo Baldan, Oscar Beijbom

2019-03-26CVPR 2020 6Autonomous VehiclesAutonomous Drivingobject-detection3D Object DetectionObject Detection
PaperPDFCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Robust detection and tracking of objects is crucial for the deployment of autonomous vehicle technology. Image based benchmark datasets have driven development in computer vision tasks such as object detection, tracking and segmentation of agents in the environment. Most autonomous vehicles, however, carry a combination of cameras and range sensors such as lidar and radar. As machine learning based methods for detection and tracking become more prevalent, there is a need to train and evaluate such methods on datasets containing range sensor data along with images. In this work we present nuTonomy scenes (nuScenes), the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view. nuScenes comprises 1000 scenes, each 20s long and fully annotated with 3D bounding boxes for 23 classes and 8 attributes. It has 7x as many annotations and 100x as many images as the pioneering KITTI dataset. We define novel 3D detection and tracking metrics. We also provide careful dataset analysis as well as baselines for lidar and image based detection and tracking. Data, development kit and more information are available online.

Results

TaskDatasetMetricValueModel
Object DetectionnuScenesNDS0.449PointPillars (ImageNet)
Object DetectionnuScenesNDS0.448PointPillars (KITTI)
Object DetectionnuScenesNDS0.442PointPillars
3DnuScenesNDS0.449PointPillars (ImageNet)
3DnuScenesNDS0.448PointPillars (KITTI)
3DnuScenesNDS0.442PointPillars
3D Object DetectionnuScenesNDS0.449PointPillars (ImageNet)
3D Object DetectionnuScenesNDS0.448PointPillars (KITTI)
3D Object DetectionnuScenesNDS0.442PointPillars
2D ClassificationnuScenesNDS0.449PointPillars (ImageNet)
2D ClassificationnuScenesNDS0.448PointPillars (KITTI)
2D ClassificationnuScenesNDS0.442PointPillars
2D Object DetectionnuScenesNDS0.449PointPillars (ImageNet)
2D Object DetectionnuScenesNDS0.448PointPillars (KITTI)
2D Object DetectionnuScenesNDS0.442PointPillars
16knuScenesNDS0.449PointPillars (ImageNet)
16knuScenesNDS0.448PointPillars (KITTI)
16knuScenesNDS0.442PointPillars

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17