Visibility-aware Multi-view Stereo Network

Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, Tian Fang

2020-08-18Point Clouds 3D Reconstruction Depth Estimation

Abstract

Learning-based multi-view stereo (MVS) methods have demonstrated promising results. However, very few existing networks explicitly take the pixel-wise visibility into consideration, resulting in erroneous cost aggregation from occluded pixels. In this paper, we explicitly infer and integrate the pixel-wise occlusion information in the MVS network via the matching uncertainty estimation. The pair-wise uncertainty map is jointly inferred with the pair-wise depth map, which is further used as weighting guidance during the multi-view cost volume fusion. As such, the adverse influence of occluded pixels is suppressed in the cost fusion. The proposed framework Vis-MVSNet significantly improves depth accuracies in the scenes with severe occlusion. Extensive experiments are performed on DTU, BlendedMVS, and Tanks and Temples datasets to justify the effectiveness of the proposed framework.

Results

Task	Dataset	Metric	Value	Model
3D Reconstruction	DTU	Acc	0.369	Vis-MVSNet
3D Reconstruction	DTU	Comp	0.361	Vis-MVSNet
3D Reconstruction	DTU	Overall	0.365	Vis-MVSNet
3D	DTU	Acc	0.369	Vis-MVSNet
3D	DTU	Comp	0.361	Vis-MVSNet
3D	DTU	Overall	0.365	Vis-MVSNet
Point Clouds	DTU	Overall	0.365	Vis-MVSNet
Point Clouds	Tanks and Temples	Mean F1 (Intermediate)	60.03	Vis-MVSNet

Related Papers

AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17 $S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17 $π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17 SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16 Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15