TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cost Volume Pyramid Network with Multi-strategies Range Se...

Cost Volume Pyramid Network with Multi-strategies Range Searching for Multi-view Stereo

Shiyu Gao, Zhaoxin Li, Zhaoqi Wang

2022-07-25Stereo Matching3D ReconstructionDepth Estimation
PaperPDFCode(official)

Abstract

Multi-view stereo is an important research task in computer vision while still keeping challenging. In recent years, deep learning-based methods have shown superior performance on this task. Cost volume pyramid network-based methods which progressively refine depth map in coarse-to-fine manner, have yielded promising results while consuming less memory. However, these methods fail to take fully consideration of the characteristics of the cost volumes in each stage, leading to adopt similar range search strategies for each cost volume stage. In this work, we present a novel cost volume pyramid based network with different searching strategies for multi-view stereo. By choosing different depth range sampling strategies and applying adaptive unimodal filtering, we are able to obtain more accurate depth estimation in low resolution stages and iteratively upsample depth map to arbitrary resolution. We conducted extensive experiments on both DTU and BlendedMVS datasets, and results show that our method outperforms most state-of-the-art methods.

Results

TaskDatasetMetricValueModel
3D ReconstructionDTUAcc0.379MSCVP-MVSNet
3D ReconstructionDTUComp0.278MSCVP-MVSNet
3D ReconstructionDTUOverall0.328MSCVP-MVSNet
3DDTUAcc0.379MSCVP-MVSNet
3DDTUComp0.278MSCVP-MVSNet
3DDTUOverall0.328MSCVP-MVSNet

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15