TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segment...

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation

Bohao Peng, Xiaoyang Wu, Li Jiang, Yukang Chen, Hengshuang Zhao, Zhuotao Tian, Jiaya Jia

2024-03-21CVPR 2024 1Semantic Segmentation3D Semantic SegmentationLIDAR Semantic Segmentation
PaperPDFCode(official)

Abstract

The booming of 3D recognition in the 2020s began with the introduction of point cloud transformers. They quickly overwhelmed sparse CNNs and became state-of-the-art models, especially in 3D semantic segmentation. However, sparse CNNs are still valuable networks, due to their efficiency treasure, and ease of application. In this work, we reexamine the design distinctions and test the limits of what a sparse CNN can achieve. We discover that the key credit to the performance difference is adaptivity. Specifically, we propose two key components, i.e., adaptive receptive fields (spatially) and adaptive relation, to bridge the gap. This exploration led to the creation of Omni-Adaptive 3D CNNs (OA-CNNs), a family of networks that integrates a lightweight module to greatly enhance the adaptivity of sparse CNNs at minimal computational cost. Without any self-attention modules, OA-CNNs favorably surpass point transformers in terms of accuracy in both indoor and outdoor scenes, with much less latency and memory cost. Notably, it achieves 76.1%, 78.9%, and 70.6% mIoU on ScanNet v2, nuScenes, and SemanticKITTI validation benchmarks respectively, while maintaining at most 5x better speed than transformer counterparts. This revelation highlights the potential of pure sparse CNNs to outperform transformer-related networks.

Results

TaskDatasetMetricValueModel
Semantic SegmentationScanNettest mIoU75.6OA-CNNs
Semantic SegmentationScanNetval mIoU76.1OA-CNNs
Semantic SegmentationScanNet200test mIoU32.3OA-CNNs
Semantic SegmentationScanNet200val mIoU33.3OA-CNNs
Semantic SegmentationScanNet++Top-1 IoU0.47OA-CNN
Semantic SegmentationScanNet++Top-3 IoU0.726OA-CNN
3D Semantic SegmentationScanNet200test mIoU32.3OA-CNNs
3D Semantic SegmentationScanNet200val mIoU33.3OA-CNNs
3D Semantic SegmentationScanNet++Top-1 IoU0.47OA-CNN
3D Semantic SegmentationScanNet++Top-3 IoU0.726OA-CNN
LIDAR Semantic SegmentationnuScenesval mIoU0.789OA-CNNs
10-shot image generationScanNettest mIoU75.6OA-CNNs
10-shot image generationScanNetval mIoU76.1OA-CNNs
10-shot image generationScanNet200test mIoU32.3OA-CNNs
10-shot image generationScanNet200val mIoU33.3OA-CNNs
10-shot image generationScanNet++Top-1 IoU0.47OA-CNN
10-shot image generationScanNet++Top-3 IoU0.726OA-CNN

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15