LION: Linear Group RNN for 3D Object Detection in Point Clouds

Zhe Liu, Jinghua Hou, Xinyu Wang, Xiaoqing Ye, Jingdong Wang, Hengshuang Zhao, Xiang Bai

2024-07-25Long-range modeling object-detection 3D Object Detection Object Detection

Abstract

The benefit of transformers in large-scale 3D point cloud perception tasks, such as 3D object detection, is limited by their quadratic computation cost when modeling long-range relationships. In contrast, linear RNNs have low computational complexity and are suitable for long-range modeling. Toward this goal, we propose a simple and effective window-based framework built on LInear grOup RNN (i.e., perform linear RNN for grouped features) for accurate 3D object detection, called LION. The key property is to allow sufficient feature interaction in a much larger group than transformer-based methods. However, effectively applying linear group RNN to 3D object detection in highly sparse point clouds is not trivial due to its limitation in handling spatial modeling. To tackle this problem, we simply introduce a 3D spatial feature descriptor and integrate it into the linear group RNN operators to enhance their spatial features rather than blindly increasing the number of scanning orders for voxel features. To further address the challenge in highly sparse point clouds, we propose a 3D voxel generation strategy to densify foreground features thanks to linear group RNN as a natural property of auto-regressive models. Extensive experiments verify the effectiveness of the proposed components and the generalization of our LION on different linear group RNN operators including Mamba, RWKV, and RetNet. Furthermore, it is worth mentioning that our LION-Mamba achieves state-of-the-art on Waymo, nuScenes, Argoverse V2, and ONCE dataset. Last but not least, our method supports kinds of advanced linear RNN operators (e.g., RetNet, RWKV, Mamba, xLSTM and TTT) on small but popular KITTI dataset for a quick experience with our linear RNN-based framework.

Results

Task	Dataset	Metric	Value	Model
Object Detection	nuScenes LiDAR only	NDS	73.9	LION
Object Detection	nuScenes LiDAR only	NDS (val)	72.1	LION
Object Detection	nuScenes LiDAR only	mAP	69.8	LION
Object Detection	nuScenes LiDAR only	mAP (val)	68	LION
Object Detection	ONCE	mAP	66.6	LION
Object Detection	Argoverse2	mAP	41.5	LION
Object Detection	Waymo Open Dataset	mAPH/L2	74	LION
3D	nuScenes LiDAR only	NDS	73.9	LION
3D	nuScenes LiDAR only	NDS (val)	72.1	LION
3D	nuScenes LiDAR only	mAP	69.8	LION
3D	nuScenes LiDAR only	mAP (val)	68	LION
3D	ONCE	mAP	66.6	LION
3D	Argoverse2	mAP	41.5	LION
3D	Waymo Open Dataset	mAPH/L2	74	LION
3D Object Detection	nuScenes LiDAR only	NDS	73.9	LION
3D Object Detection	nuScenes LiDAR only	NDS (val)	72.1	LION
3D Object Detection	nuScenes LiDAR only	mAP	69.8	LION
3D Object Detection	nuScenes LiDAR only	mAP (val)	68	LION
3D Object Detection	ONCE	mAP	66.6	LION
3D Object Detection	Argoverse2	mAP	41.5	LION
3D Object Detection	Waymo Open Dataset	mAPH/L2	74	LION
2D Classification	nuScenes LiDAR only	NDS	73.9	LION
2D Classification	nuScenes LiDAR only	NDS (val)	72.1	LION
2D Classification	nuScenes LiDAR only	mAP	69.8	LION
2D Classification	nuScenes LiDAR only	mAP (val)	68	LION
2D Classification	ONCE	mAP	66.6	LION
2D Classification	Argoverse2	mAP	41.5	LION
2D Classification	Waymo Open Dataset	mAPH/L2	74	LION
2D Object Detection	nuScenes LiDAR only	NDS	73.9	LION
2D Object Detection	nuScenes LiDAR only	NDS (val)	72.1	LION
2D Object Detection	nuScenes LiDAR only	mAP	69.8	LION
2D Object Detection	nuScenes LiDAR only	mAP (val)	68	LION
2D Object Detection	ONCE	mAP	66.6	LION
2D Object Detection	Argoverse2	mAP	41.5	LION
2D Object Detection	Waymo Open Dataset	mAPH/L2	74	LION
16k	nuScenes LiDAR only	NDS	73.9	LION
16k	nuScenes LiDAR only	NDS (val)	72.1	LION
16k	nuScenes LiDAR only	mAP	69.8	LION
16k	nuScenes LiDAR only	mAP (val)	68	LION
16k	ONCE	mAP	66.6	LION
16k	Argoverse2	mAP	41.5	LION
16k	Waymo Open Dataset	mAPH/L2	74	LION

LION: Linear Group RNN for 3D Object Detection in Point Clouds

Abstract

Results

Related Papers

LION: Linear Group RNN for 3D Object Detection in Point Clouds

Abstract

Results

Related Papers