TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panop...

Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking

Whye Kit Fong, Rohit Mohan, Juana Valeria Hurtado, Lubing Zhou, Holger Caesar, Oscar Beijbom, Abhinav Valada

2021-09-08BenchmarkingPanoptic SegmentationNavigateScene UnderstandingSemantic Segmentation
PaperPDFCode

Abstract

Panoptic scene understanding and tracking of dynamic agents are essential for robots and automated vehicles to navigate in urban environments. As LiDARs provide accurate illumination-independent geometric depictions of the scene, performing these tasks using LiDAR point clouds provides reliable predictions. However, existing datasets lack diversity in the type of urban scenes and have a limited number of dynamic object instances which hinders both learning of these tasks as well as credible benchmarking of the developed methods. In this paper, we introduce the large-scale Panoptic nuScenes benchmark dataset that extends our popular nuScenes dataset with point-wise groundtruth annotations for semantic segmentation, panoptic segmentation, and panoptic tracking tasks. To facilitate comparison, we provide several strong baselines for each of these tasks on our proposed dataset. Moreover, we analyze the drawbacks of the existing metrics for panoptic tracking and propose the novel instance-centric PAT metric that addresses the concerns. We present exhaustive experiments that demonstrate the utility of Panoptic nuScenes compared to existing datasets and make the online evaluation server available at nuScenes.org. We believe that this extension will accelerate the research of novel methods for scene understanding of dynamic urban environments.

Results

TaskDatasetMetricValueModel
Semantic SegmentationPanoptic nuScenes testPQ76.8(AF)2-S3Net + CenterPoint
Semantic SegmentationPanoptic nuScenes testRQ85.4(AF)2-S3Net + CenterPoint
Semantic SegmentationPanoptic nuScenes testSQ89.5(AF)2-S3Net + CenterPoint
Semantic SegmentationPanoptic nuScenes testmIoU78.8(AF)2-S3Net + CenterPoint
Semantic SegmentationPanoptic nuScenes valPQ63.4PolarSeg-Panoptic
Semantic SegmentationPanoptic nuScenes valRQ75.3PolarSeg-Panoptic
Semantic SegmentationPanoptic nuScenes valSQ83.9PolarSeg-Panoptic
Semantic SegmentationPanoptic nuScenes valmIoU66.9PolarSeg-Panoptic
10-shot image generationPanoptic nuScenes testPQ76.8(AF)2-S3Net + CenterPoint
10-shot image generationPanoptic nuScenes testRQ85.4(AF)2-S3Net + CenterPoint
10-shot image generationPanoptic nuScenes testSQ89.5(AF)2-S3Net + CenterPoint
10-shot image generationPanoptic nuScenes testmIoU78.8(AF)2-S3Net + CenterPoint
10-shot image generationPanoptic nuScenes valPQ63.4PolarSeg-Panoptic
10-shot image generationPanoptic nuScenes valRQ75.3PolarSeg-Panoptic
10-shot image generationPanoptic nuScenes valSQ83.9PolarSeg-Panoptic
10-shot image generationPanoptic nuScenes valmIoU66.9PolarSeg-Panoptic
Panoptic SegmentationPanoptic nuScenes testPQ76.8(AF)2-S3Net + CenterPoint
Panoptic SegmentationPanoptic nuScenes testRQ85.4(AF)2-S3Net + CenterPoint
Panoptic SegmentationPanoptic nuScenes testSQ89.5(AF)2-S3Net + CenterPoint
Panoptic SegmentationPanoptic nuScenes testmIoU78.8(AF)2-S3Net + CenterPoint
Panoptic SegmentationPanoptic nuScenes valPQ63.4PolarSeg-Panoptic
Panoptic SegmentationPanoptic nuScenes valRQ75.3PolarSeg-Panoptic
Panoptic SegmentationPanoptic nuScenes valSQ83.9PolarSeg-Panoptic
Panoptic SegmentationPanoptic nuScenes valmIoU66.9PolarSeg-Panoptic

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Visual Place Recognition for Large-Scale UAV Applications2025-07-20Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17