TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Spherical Transformer for LiDAR-based 3D Recognition

Spherical Transformer for LiDAR-based 3D Recognition

Xin Lai, Yukang Chen, Fanbin Lu, Jianhui Liu, Jiaya Jia

2023-03-22CVPR 2023 1feature selectionSemantic Segmentation3D Semantic Segmentationobject-detection3D Object DetectionObject DetectionLIDAR Semantic Segmentation
PaperPDFCodeCode(official)

Abstract

LiDAR-based 3D point cloud recognition has benefited various applications. Without specially considering the LiDAR point distribution, most current methods suffer from information disconnection and limited receptive field, especially for the sparse distant points. In this work, we study the varying-sparsity distribution of LiDAR points and present SphereFormer to directly aggregate information from dense close points to the sparse distant ones. We design radial window self-attention that partitions the space into multiple non-overlapping narrow and long windows. It overcomes the disconnection issue and enlarges the receptive field smoothly and dramatically, which significantly boosts the performance of sparse distant points. Moreover, to fit the narrow and long windows, we propose exponential splitting to yield fine-grained position encoding and dynamic feature selection to increase model representation ability. Notably, our method ranks 1st on both nuScenes and SemanticKITTI semantic segmentation benchmarks with 81.9% and 74.8% mIoU, respectively. Also, we achieve the 3rd place on nuScenes object detection benchmark with 72.8% NDS and 68.5% mAP. Code is available at https://github.com/dvlab-research/SphereFormer.git.

Results

TaskDatasetMetricValueModel
Semantic SegmentationKITTI Semantic SegmentationMean IoU (class)80.7RPVNet [xu2021rpvnet]
Semantic SegmentationWildScenesmIoU33.97SphereFormer
3D Semantic SegmentationWildScenesmIoU33.97SphereFormer
LIDAR Semantic SegmentationnuScenestest mIoU0.819SphereFormer
LIDAR Semantic SegmentationnuScenesval mIoU0.795SphereFormer
10-shot image generationKITTI Semantic SegmentationMean IoU (class)80.7RPVNet [xu2021rpvnet]
10-shot image generationWildScenesmIoU33.97SphereFormer

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21mNARX+: A surrogate model for complex dynamical systems using manifold-NARX and automatic feature selection2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17