TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Where2comm: Communication-Efficient Collaborative Percepti...

Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps

Yue Hu, Shaoheng Fang, Zixing Lei, Yiqi Zhong, Siheng Chen

2022-09-26Monocular 3D Object Detectionobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Multi-agent collaborative perception could significantly upgrade the perception performance by enabling agents to share complementary information with each other through communication. It inevitably results in a fundamental trade-off between perception performance and communication bandwidth. To tackle this bottleneck issue, we propose a spatial confidence map, which reflects the spatial heterogeneity of perceptual information. It empowers agents to only share spatially sparse, yet perceptually critical information, contributing to where to communicate. Based on this novel spatial confidence map, we propose Where2comm, a communication-efficient collaborative perception framework. Where2comm has two distinct advantages: i) it considers pragmatic compression and uses less communication to achieve higher perception performance by focusing on perceptually critical areas; and ii) it can handle varying communication bandwidth by dynamically adjusting spatial areas involved in communication. To evaluate Where2comm, we consider 3D object detection in both real-world and simulation scenarios with two modalities (camera/LiDAR) and two agent types (cars/drones) on four datasets: OPV2V, V2X-Sim, DAIR-V2X, and our original CoPerception-UAVs. Where2comm consistently outperforms previous methods; for example, it achieves more than $100,000 \times$ lower communication volume and still outperforms DiscoNet and V2X-ViT on OPV2V. Our code is available at https://github.com/MediaBrain-SJTU/where2comm.

Results

TaskDatasetMetricValueModel
Object DetectionDAIR-V2XAP5063.71Where2comm
Object DetectionV2X-SIMmAOE0.31Where2comm
Object DetectionV2X-SIMmAP19Where2comm
Object DetectionV2X-SIMmASE0.275Where2comm
Object DetectionV2X-SIMmATE0.911Where2comm
Object DetectionCoPerception-UAVsAP5065.71Where2comm
Object DetectionOPV2VAP5047.14Where2comm
3DDAIR-V2XAP5063.71Where2comm
3DV2X-SIMmAOE0.31Where2comm
3DV2X-SIMmAP19Where2comm
3DV2X-SIMmASE0.275Where2comm
3DV2X-SIMmATE0.911Where2comm
3DCoPerception-UAVsAP5065.71Where2comm
3DOPV2VAP5047.14Where2comm
3D Object DetectionDAIR-V2XAP5063.71Where2comm
3D Object DetectionV2X-SIMmAOE0.31Where2comm
3D Object DetectionV2X-SIMmAP19Where2comm
3D Object DetectionV2X-SIMmASE0.275Where2comm
3D Object DetectionV2X-SIMmATE0.911Where2comm
3D Object DetectionCoPerception-UAVsAP5065.71Where2comm
3D Object DetectionOPV2VAP5047.14Where2comm
2D ClassificationDAIR-V2XAP5063.71Where2comm
2D ClassificationV2X-SIMmAOE0.31Where2comm
2D ClassificationV2X-SIMmAP19Where2comm
2D ClassificationV2X-SIMmASE0.275Where2comm
2D ClassificationV2X-SIMmATE0.911Where2comm
2D ClassificationCoPerception-UAVsAP5065.71Where2comm
2D ClassificationOPV2VAP5047.14Where2comm
2D Object DetectionDAIR-V2XAP5063.71Where2comm
2D Object DetectionV2X-SIMmAOE0.31Where2comm
2D Object DetectionV2X-SIMmAP19Where2comm
2D Object DetectionV2X-SIMmASE0.275Where2comm
2D Object DetectionV2X-SIMmATE0.911Where2comm
2D Object DetectionCoPerception-UAVsAP5065.71Where2comm
2D Object DetectionOPV2VAP5047.14Where2comm
16kDAIR-V2XAP5063.71Where2comm
16kV2X-SIMmAOE0.31Where2comm
16kV2X-SIMmAP19Where2comm
16kV2X-SIMmASE0.275Where2comm
16kV2X-SIMmATE0.911Where2comm
16kCoPerception-UAVsAP5065.71Where2comm
16kOPV2VAP5047.14Where2comm

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07