TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Visibility-aware Multi-view Stereo Network

Visibility-aware Multi-view Stereo Network

Jingyang Zhang, Yao Yao, Shiwei Li, Zixin Luo, Tian Fang

2020-08-18Point Clouds3D ReconstructionDepth Estimation
PaperPDFCode(official)

Abstract

Learning-based multi-view stereo (MVS) methods have demonstrated promising results. However, very few existing networks explicitly take the pixel-wise visibility into consideration, resulting in erroneous cost aggregation from occluded pixels. In this paper, we explicitly infer and integrate the pixel-wise occlusion information in the MVS network via the matching uncertainty estimation. The pair-wise uncertainty map is jointly inferred with the pair-wise depth map, which is further used as weighting guidance during the multi-view cost volume fusion. As such, the adverse influence of occluded pixels is suppressed in the cost fusion. The proposed framework Vis-MVSNet significantly improves depth accuracies in the scenes with severe occlusion. Extensive experiments are performed on DTU, BlendedMVS, and Tanks and Temples datasets to justify the effectiveness of the proposed framework.

Results

TaskDatasetMetricValueModel
3D ReconstructionDTUAcc0.369Vis-MVSNet
3D ReconstructionDTUComp0.361Vis-MVSNet
3D ReconstructionDTUOverall0.365Vis-MVSNet
3DDTUAcc0.369Vis-MVSNet
3DDTUComp0.361Vis-MVSNet
3DDTUOverall0.365Vis-MVSNet
Point CloudsDTUOverall0.365Vis-MVSNet
Point CloudsTanks and TemplesMean F1 (Intermediate)60.03Vis-MVSNet

Related Papers

AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15