TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MobileStereoNet: Towards Lightweight Deep Networks for Ste...

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Faranak Shamsafar, Samuel Woerz, Rafia Rahim, Andreas Zell

2021-08-22Stereo MatchingStereo Depth EstimationDisparity EstimationDepth Estimation
PaperPDFCodeCodeCodeCode(official)

Abstract

Recent methods in stereo matching have continuously improved the accuracy using deep models. This gain, however, is attained with a high increase in computation cost, such that the network may not fit even on a moderate GPU. This issue raises problems when the model needs to be deployed on resource-limited devices. For this, we propose two light models for stereo vision with reduced complexity and without sacrificing accuracy. Depending on the dimension of cost volume, we design a 2D and a 3D model with encoder-decoders built from 2D and 3D convolutions, respectively. To this end, we leverage 2D MobileNet blocks and extend them to 3D for stereo vision application. Besides, a new cost volume is proposed to boost the accuracy of the 2D model, making it performing close to 3D networks. Experiments show that the proposed 2D/3D networks effectively reduce the computational expense (27%/95% and 72%/38% fewer parameters/operations in 2D and 3D models, respectively) while upholding the accuracy. Our code is available at https://github.com/cogsys-tuebingen/mobilestereonet.

Results

TaskDatasetMetricValueModel
Depth EstimationsceneflowAverage End-Point Error0.83D-MobileStereoNet
Depth EstimationsceneflowEPE0.83D-MobileStereoNet
Depth EstimationsceneflowAverage End-Point Error1.142D-MobileStereoNet
Depth EstimationsceneflowEPE1.142D-MobileStereoNet
Depth EstimationKITTI2015three pixel error1.693D-MobileStereoNet
Depth EstimationKITTI2015three pixel error2.672D-MobileStereoNet
3DsceneflowAverage End-Point Error0.83D-MobileStereoNet
3DsceneflowEPE0.83D-MobileStereoNet
3DsceneflowAverage End-Point Error1.142D-MobileStereoNet
3DsceneflowEPE1.142D-MobileStereoNet
3DKITTI2015three pixel error1.693D-MobileStereoNet
3DKITTI2015three pixel error2.672D-MobileStereoNet
Stereo Depth EstimationsceneflowAverage End-Point Error0.83D-MobileStereoNet
Stereo Depth EstimationsceneflowEPE0.83D-MobileStereoNet
Stereo Depth EstimationsceneflowAverage End-Point Error1.142D-MobileStereoNet
Stereo Depth EstimationsceneflowEPE1.142D-MobileStereoNet
Stereo Depth EstimationKITTI2015three pixel error1.693D-MobileStereoNet
Stereo Depth EstimationKITTI2015three pixel error2.672D-MobileStereoNet

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Cameras as Relative Positional Encoding2025-07-14ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way2025-07-11