Benjin Zhu, Zhengkai Jiang, Xiangxin Zhou, Zeming Li, Gang Yu
This report presents our method which wins the nuScenes3D Detection Challenge [17] held in Workshop on Autonomous Driving(WAD, CVPR 2019). Generally, we utilize sparse 3D convolution to extract rich semantic features, which are then fed into a class-balanced multi-head network to perform 3D object detection. To handle the severe class imbalance problem inherent in the autonomous driving scenarios, we design a class-balanced sampling and augmentation strategy to generate a more balanced data distribution. Furthermore, we propose a balanced group-ing head to boost the performance for the categories withsimilar shapes. Based on the Challenge results, our methodoutperforms the PointPillars [14] baseline by a large mar-gin across all metrics, achieving state-of-the-art detection performance on the nuScenes dataset. Code will be released at CBGS.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Object Detection | nuScenes LiDAR only | NDS | 63.3 | CBGS |
| Object Detection | nuScenes LiDAR only | NDS (val) | 62.3 | CBGS |
| Object Detection | nuScenes LiDAR only | mAP | 52.8 | CBGS |
| Object Detection | nuScenes LiDAR only | mAP (val) | 50.6 | CBGS |
| Object Detection | nuScenes | mAP | 0.528 | MEGVII |
| 3D | nuScenes LiDAR only | NDS | 63.3 | CBGS |
| 3D | nuScenes LiDAR only | NDS (val) | 62.3 | CBGS |
| 3D | nuScenes LiDAR only | mAP | 52.8 | CBGS |
| 3D | nuScenes LiDAR only | mAP (val) | 50.6 | CBGS |
| 3D | nuScenes | mAP | 0.528 | MEGVII |
| 3D Object Detection | nuScenes LiDAR only | NDS | 63.3 | CBGS |
| 3D Object Detection | nuScenes LiDAR only | NDS (val) | 62.3 | CBGS |
| 3D Object Detection | nuScenes LiDAR only | mAP | 52.8 | CBGS |
| 3D Object Detection | nuScenes LiDAR only | mAP (val) | 50.6 | CBGS |
| 3D Object Detection | nuScenes | mAP | 0.528 | MEGVII |
| 2D Classification | nuScenes LiDAR only | NDS | 63.3 | CBGS |
| 2D Classification | nuScenes LiDAR only | NDS (val) | 62.3 | CBGS |
| 2D Classification | nuScenes LiDAR only | mAP | 52.8 | CBGS |
| 2D Classification | nuScenes LiDAR only | mAP (val) | 50.6 | CBGS |
| 2D Classification | nuScenes | mAP | 0.528 | MEGVII |
| 2D Object Detection | nuScenes LiDAR only | NDS | 63.3 | CBGS |
| 2D Object Detection | nuScenes LiDAR only | NDS (val) | 62.3 | CBGS |
| 2D Object Detection | nuScenes LiDAR only | mAP | 52.8 | CBGS |
| 2D Object Detection | nuScenes LiDAR only | mAP (val) | 50.6 | CBGS |
| 2D Object Detection | nuScenes | mAP | 0.528 | MEGVII |
| 16k | nuScenes LiDAR only | NDS | 63.3 | CBGS |
| 16k | nuScenes LiDAR only | NDS (val) | 62.3 | CBGS |
| 16k | nuScenes LiDAR only | mAP | 52.8 | CBGS |
| 16k | nuScenes LiDAR only | mAP (val) | 50.6 | CBGS |
| 16k | nuScenes | mAP | 0.528 | MEGVII |