TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Less is More: Reducing Task and Model Complexity for 3D Po...

Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

Li Li, Hubert P. H. Shum, Toby P. Breckon

2023-03-20CVPR 2023 1Semi-Supervised Semantic SegmentationSegmentationAutonomous DrivingSemantic Segmentation3D Semantic Segmentation
PaperPDFCode(official)

Abstract

Whilst the availability of 3D LiDAR point cloud data has significantly grown in recent years, annotation remains expensive and time-consuming, leading to a demand for semi-supervised semantic segmentation methods with application domains such as autonomous driving. Existing work very often employs relatively large segmentation backbone networks to improve segmentation accuracy, at the expense of computational costs. In addition, many use uniform sampling to reduce ground truth data requirements for learning needed, often resulting in sub-optimal performance. To address these issues, we propose a new pipeline that employs a smaller architecture, requiring fewer ground-truth annotations to achieve superior segmentation accuracy compared to contemporary approaches. This is facilitated via a novel Sparse Depthwise Separable Convolution module that significantly reduces the network parameter count while retaining overall task performance. To effectively sub-sample our training data, we propose a new Spatio-Temporal Redundant Frame Downsampling (ST-RFD) method that leverages knowledge of sensor motion within the environment to extract a more diverse subset of training data frame samples. To leverage the use of limited annotated data samples, we further propose a soft pseudo-label method informed by LiDAR reflectivity. Our method outperforms contemporary semi-supervised work in terms of mIoU, using less labeled data, on the SemanticKITTI (59.5@5%) and ScribbleKITTI (58.1@5%) benchmark datasets, based on a 2.3x reduction in model parameters and 641x fewer multiply-add operations whilst also demonstrating significant performance improvement on limited training data (i.e., Less is More).

Results

TaskDatasetMetricValueModel
Semantic SegmentationSemanticKITTImIoU-1%58.4LiM3D
Semantic SegmentationSemanticKITTImIoU-1%57.2LiM3D+SDSC
Semantic SegmentationScribbleKITTImIoU-1%57LiM3D
Semantic SegmentationScribbleKITTImIoU-1%55.8LiM3D+SDSC
3D Semantic SegmentationSemanticKITTImIoU-1%58.4LiM3D
3D Semantic SegmentationSemanticKITTImIoU-1%57.2LiM3D+SDSC
3D Semantic SegmentationScribbleKITTImIoU-1%57LiM3D
3D Semantic SegmentationScribbleKITTImIoU-1%55.8LiM3D+SDSC
10-shot image generationSemanticKITTImIoU-1%58.4LiM3D
10-shot image generationSemanticKITTImIoU-1%57.2LiM3D+SDSC
10-shot image generationScribbleKITTImIoU-1%57LiM3D
10-shot image generationScribbleKITTImIoU-1%55.8LiM3D+SDSC

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17