OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

Runsheng Xu, Hao Xiang, Xin Xia, Xu Han, Jinlong Li, Jiaqi Ma

2021-09-16Benchmarking 3D Object Detection

Abstract

Employing Vehicle-to-Vehicle communication to enhance perception performance in self-driving technology has attracted considerable attention recently; however, the absence of a suitable open dataset for benchmarking algorithms has made it difficult to develop and assess cooperative perception technologies. To this end, we present the first large-scale open simulated dataset for Vehicle-to-Vehicle perception. It contains over 70 interesting scenes, 11,464 frames, and 232,913 annotated 3D vehicle bounding boxes, collected from 8 towns in CARLA and a digital town of Culver City, Los Angeles. We then construct a comprehensive benchmark with a total of 16 implemented models to evaluate several information fusion strategies~(i.e. early, late, and intermediate fusion) with state-of-the-art LiDAR detection algorithms. Moreover, we propose a new Attentive Intermediate Fusion pipeline to aggregate information from multiple connected vehicles. Our experiments show that the proposed pipeline can be easily integrated with existing 3D LiDAR detectors and achieve outstanding performance even with large compression rates. To encourage more researchers to investigate Vehicle-to-Vehicle perception, we will release the dataset, benchmark methods, and all related codes in https://mobility-lab.seas.ucla.edu/opv2v/.

Results

Task	Dataset	Metric	Value	Model
Object Detection	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
Object Detection	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
Object Detection	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
Object Detection	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
Object Detection	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
Object Detection	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
Object Detection	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
Object Detection	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
3D	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
3D	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
3D	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
3D	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
3D	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
3D	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
3D	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
3D	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
3D Object Detection	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
3D Object Detection	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
3D Object Detection	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
3D Object Detection	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
3D Object Detection	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
3D Object Detection	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
3D Object Detection	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
3D Object Detection	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
2D Classification	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
2D Classification	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
2D Classification	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
2D Classification	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
2D Classification	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
2D Classification	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
2D Classification	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
2D Classification	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
2D Object Detection	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
2D Object Detection	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
2D Object Detection	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
2D Object Detection	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
2D Object Detection	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
2D Object Detection	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
2D Object Detection	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
2D Object Detection	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
16k	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
16k	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
16k	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
16k	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
16k	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
16k	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
16k	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
16k	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion

Abstract

Results

Task	Dataset	Metric	Value	Model
Object Detection	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
Object Detection	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
Object Detection	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
Object Detection	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
Object Detection	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
Object Detection	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
Object Detection	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
Object Detection	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
3D	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
3D	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
3D	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
3D	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
3D	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
3D	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
3D	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
3D	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
3D Object Detection	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
3D Object Detection	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
3D Object Detection	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
3D Object Detection	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
3D Object Detection	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
3D Object Detection	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
3D Object Detection	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
3D Object Detection	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
2D Classification	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
2D Classification	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
2D Classification	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
2D Classification	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
2D Classification	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
2D Classification	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
2D Classification	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
2D Classification	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
2D Object Detection	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
2D Object Detection	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
2D Object Detection	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
2D Object Detection	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
2D Object Detection	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
2D Object Detection	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
2D Object Detection	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
2D Object Detection	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion
16k	OPV2V	AP@0.7@CulverCity	0.735	Attentive Fusion (PointPillar backbone)
16k	OPV2V	AP@0.7@Default	0.815	Attentive Fusion (PointPillar backbone)
16k	OPV2V	AP@0.7@CulverCity	0.669	Late Fusion (PointPillar backbone)
16k	OPV2V	AP@0.7@Default	0.781	Late Fusion (PointPillar backbone)
16k	V2XSet	AP0.5 (Noisy)	0.709	AttentiveFusion
16k	V2XSet	AP0.5 (Perfect)	0.807	AttentiveFusion
16k	V2XSet	AP0.7 (Noisy)	0.487	AttentiveFusion
16k	V2XSet	AP0.7 (Perfect)	0.664	AttentiveFusion

OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

Abstract

Results

Related Papers

OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

Abstract

Results

Related Papers