MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Faranak Shamsafar, Samuel Woerz, Rafia Rahim, Andreas Zell

2021-08-22Stereo Matching Stereo Depth Estimation Disparity Estimation Depth Estimation

Abstract

Recent methods in stereo matching have continuously improved the accuracy using deep models. This gain, however, is attained with a high increase in computation cost, such that the network may not fit even on a moderate GPU. This issue raises problems when the model needs to be deployed on resource-limited devices. For this, we propose two light models for stereo vision with reduced complexity and without sacrificing accuracy. Depending on the dimension of cost volume, we design a 2D and a 3D model with encoder-decoders built from 2D and 3D convolutions, respectively. To this end, we leverage 2D MobileNet blocks and extend them to 3D for stereo vision application. Besides, a new cost volume is proposed to boost the accuracy of the 2D model, making it performing close to 3D networks. Experiments show that the proposed 2D/3D networks effectively reduce the computational expense (27%/95% and 72%/38% fewer parameters/operations in 2D and 3D models, respectively) while upholding the accuracy. Our code is available at https://github.com/cogsys-tuebingen/mobilestereonet.

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	sceneflow	Average End-Point Error	0.8	3D-MobileStereoNet
Depth Estimation	sceneflow	EPE	0.8	3D-MobileStereoNet
Depth Estimation	sceneflow	Average End-Point Error	1.14	2D-MobileStereoNet
Depth Estimation	sceneflow	EPE	1.14	2D-MobileStereoNet
Depth Estimation	KITTI2015	three pixel error	1.69	3D-MobileStereoNet
Depth Estimation	KITTI2015	three pixel error	2.67	2D-MobileStereoNet
3D	sceneflow	Average End-Point Error	0.8	3D-MobileStereoNet
3D	sceneflow	EPE	0.8	3D-MobileStereoNet
3D	sceneflow	Average End-Point Error	1.14	2D-MobileStereoNet
3D	sceneflow	EPE	1.14	2D-MobileStereoNet
3D	KITTI2015	three pixel error	1.69	3D-MobileStereoNet
3D	KITTI2015	three pixel error	2.67	2D-MobileStereoNet
Stereo Depth Estimation	sceneflow	Average End-Point Error	0.8	3D-MobileStereoNet
Stereo Depth Estimation	sceneflow	EPE	0.8	3D-MobileStereoNet
Stereo Depth Estimation	sceneflow	Average End-Point Error	1.14	2D-MobileStereoNet
Stereo Depth Estimation	sceneflow	EPE	1.14	2D-MobileStereoNet
Stereo Depth Estimation	KITTI2015	three pixel error	1.69	3D-MobileStereoNet
Stereo Depth Estimation	KITTI2015	three pixel error	2.67	2D-MobileStereoNet

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Abstract

Results

Related Papers

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Abstract

Results

Related Papers