TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multimodal Object Detection via Probabilistic Ensembling

Multimodal Object Detection via Probabilistic Ensembling

Yi-Ting Chen, Jinghao Shi, Zelin Ye, Christoph Mertz, Deva Ramanan, Shu Kong

2021-04-07Autonomous VehiclesPedestrian Detectionobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)CodeCode

Abstract

Object detection with multimodal inputs can improve many safety-critical systems such as autonomous vehicles (AVs). Motivated by AVs that operate in both day and night, we study multimodal object detection with RGB and thermal cameras, since the latter provides much stronger object signatures under poor illumination. We explore strategies for fusing information from different modalities. Our key contribution is a probabilistic ensembling technique, ProbEn, a simple non-learned method that fuses together detections from multi-modalities. We derive ProbEn from Bayes' rule and first principles that assume conditional independence across modalities. Through probabilistic marginalization, ProbEn elegantly handles missing modalities when detectors do not fire on the same object. Importantly, ProbEn also notably improves multimodal detection even when the conditional independence assumption does not hold, e.g., fusing outputs from other fusion methods (both off-the-shelf and trained in-house). We validate ProbEn on two benchmarks containing both aligned (KAIST) and unaligned (FLIR) multimodal images, showing that ProbEn outperforms prior work by more than 13% in relative performance!

Results

TaskDatasetMetricValueModel
Autonomous VehiclesLLVIPAP0.515ProbEn
Autonomous VehiclesLLVIPAP0.515ProbEn
Object DetectionInOutDoor AP62.4ProbEN
Object DetectionEventPedAP60.1ProEN
Object DetectionSTCrowdAP60ProbEN
3DInOutDoor AP62.4ProbEN
3DEventPedAP60.1ProEN
3DSTCrowdAP60ProbEN
2D ClassificationInOutDoor AP62.4ProbEN
2D ClassificationEventPedAP60.1ProEN
2D ClassificationSTCrowdAP60ProbEN
Pedestrian DetectionLLVIPAP0.515ProbEn
Pedestrian DetectionLLVIPAP0.515ProbEn
2D Object DetectionInOutDoor AP62.4ProbEN
2D Object DetectionEventPedAP60.1ProEN
2D Object DetectionSTCrowdAP60ProbEN
16kInOutDoor AP62.4ProbEN
16kEventPedAP60.1ProEN
16kSTCrowdAP60ProbEN

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15Fast and Accurate Collision Probability Estimation for Autonomous Vehicles using Adaptive Sigma-Point Sampling2025-07-08ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08