TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Confidence Propagation through CNNs for Guided Sparse Dept...

Confidence Propagation through CNNs for Guided Sparse Depth Regression

Abdelrahman Eldesokey, Michael Felsberg, Fahad Shahbaz Khan

2018-11-05Depth CompletionregressionAutonomous Driving
PaperPDFCode(official)

Abstract

Generally, convolutional neural networks (CNNs) process data on a regular grid, e.g. data generated by ordinary cameras. Designing CNNs for sparse and irregularly spaced input data is still an open research problem with numerous applications in autonomous driving, robotics, and surveillance. In this paper, we propose an algebraically-constrained normalized convolution layer for CNNs with highly sparse input that has a smaller number of network parameters compared to related work. We propose novel strategies for determining the confidence from the convolution operation and propagating it to consecutive layers. We also propose an objective function that simultaneously minimizes the data error while maximizing the output confidence. To integrate structural information, we also investigate fusion strategies to combine depth and RGB information in our normalized convolution network framework. In addition, we introduce the use of output confidence as an auxiliary information to improve the results. The capabilities of our normalized convolution network framework are demonstrated for the problem of scene depth completion. Comprehensive experiments are performed on the KITTI-Depth and the NYU-Depth-v2 datasets. The results clearly demonstrate that the proposed approach achieves superior performance while requiring only about 1-5% of the number of parameters compared to the state-of-the-art methods.

Results

TaskDatasetMetricValueModel
Depth CompletionKITTI Depth CompletionMAE233NConv-CNN-L2
Depth CompletionKITTI Depth CompletionRMSE830NConv-CNN-L2
Depth CompletionKITTI Depth CompletionRuntime [ms]20NConv-CNN-L2
Depth CompletionKITTI Depth CompletionMAE208NConv-CNN-L1
Depth CompletionKITTI Depth CompletionRMSE859NConv-CNN-L1
Depth CompletionKITTI Depth CompletionRuntime [ms]20NConv-CNN-L1
Depth CompletionKITTI Depth CompletionMAE360NConv-CNN
Depth CompletionKITTI Depth CompletionRMSE1268NConv-CNN
Depth CompletionKITTI Depth CompletionRuntime [ms]10NConv-CNN

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16