TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FSD V2: Improving Fully Sparse 3D Object Detection with Vi...

FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels

Lue Fan, Feng Wang, Naiyan Wang, Zhaoxiang Zhang

2023-08-07Semantic SegmentationClusteringInstance Segmentation3D Multi-Object Trackingobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

LiDAR-based fully sparse architecture has garnered increasing attention. FSDv1 stands out as a representative work, achieving impressive efficacy and efficiency, albeit with intricate structures and handcrafted designs. In this paper, we present FSDv2, an evolution that aims to simplify the previous FSDv1 while eliminating the inductive bias introduced by its handcrafted instance-level representation, thus promoting better general applicability. To this end, we introduce the concept of \textbf{virtual voxels}, which takes over the clustering-based instance segmentation in FSDv1. Virtual voxels not only address the notorious issue of the Center Feature Missing problem in fully sparse detectors but also endow the framework with a more elegant and streamlined approach. Consequently, we develop a suite of components to complement the virtual voxel concept, including a virtual voxel encoder, a virtual voxel mixer, and a virtual voxel assignment strategy. Through empirical validation, we demonstrate that the virtual voxel mechanism is functionally similar to the handcrafted clustering in FSDv1 while being more general. We conduct experiments on three large-scale datasets: Waymo Open Dataset, Argoverse 2 dataset, and nuScenes dataset. Our results showcase state-of-the-art performance on all three datasets, highlighting the superiority of FSDv2 in long-range scenarios and its general applicability to achieve competitive performance across diverse scenarios. Moreover, we provide comprehensive experimental analysis to elucidate the workings of FSDv2. To foster reproducibility and further research, we have open-sourced FSDv2 at https://github.com/tusen-ai/SST.

Results

TaskDatasetMetricValueModel
Multi-Object TrackingWaymo Open Dataset: Vehicle (Online Methods)FP/L20.0745CTRL_FSD_TTA
Multi-Object TrackingWaymo Open Dataset: Vehicle (Online Methods)MOTA/L10.7735CTRL_FSD_TTA
Multi-Object TrackingWaymo Open Dataset: Vehicle (Online Methods)MOTA/L20.7429CTRL_FSD_TTA
Object TrackingWaymo Open Dataset: Vehicle (Online Methods)FP/L20.0745CTRL_FSD_TTA
Object TrackingWaymo Open Dataset: Vehicle (Online Methods)MOTA/L10.7735CTRL_FSD_TTA
Object TrackingWaymo Open Dataset: Vehicle (Online Methods)MOTA/L20.7429CTRL_FSD_TTA
3D Multi-Object TrackingWaymo Open Dataset: Vehicle (Online Methods)FP/L20.0745CTRL_FSD_TTA
3D Multi-Object TrackingWaymo Open Dataset: Vehicle (Online Methods)MOTA/L10.7735CTRL_FSD_TTA
3D Multi-Object TrackingWaymo Open Dataset: Vehicle (Online Methods)MOTA/L20.7429CTRL_FSD_TTA

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17