TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HITNet: Hierarchical Iterative Tile Refinement Network for...

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching

Vladimir Tankovich, Christian Häne, yinda zhang, Adarsh Kowdle, Sean Fanello, Sofien Bouaziz

2020-07-23CVPR 2021 1Stereo MatchingStereo Depth EstimationStereo Disparity Estimation
PaperPDFCodeCodeCodeCodeCodeCodeCode(official)Code(official)Code

Abstract

This paper presents HITNet, a novel neural network architecture for real-time stereo matching. Contrary to many recent neural network approaches that operate on a full cost volume and rely on 3D convolutions, our approach does not explicitly build a volume and instead relies on a fast multi-resolution initialization step, differentiable 2D geometric propagation and warping mechanisms to infer disparity hypotheses. To achieve a high level of accuracy, our network not only geometrically reasons about disparities but also infers slanted plane hypotheses allowing to more accurately perform geometric warping and upsampling operations. Our architecture is inherently multi-resolution allowing the propagation of information across different levels. Multiple experiments prove the effectiveness of the proposed approach at a fraction of the computation required by state-of-the-art methods. At the time of writing, HITNet ranks 1st-3rd on all the metrics published on the ETH3D website for two view stereo, ranks 1st on most of the metrics among all the end-to-end learning approaches on Middlebury-v3, ranks 1st on the popular KITTI 2012 and 2015 benchmarks among the published methods faster than 100ms.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI2015three pixel error2.43HITNET
3DKITTI2015three pixel error2.43HITNET
Stereo Disparity EstimationScene FlowEPE0.529HITNet
Stereo Disparity EstimationScene Flowone pixel error5.52HITNet
Stereo Disparity EstimationScene Flowthree pixel error3HITNet
Stereo Disparity EstimationScene FlowEPE0.43HITNet L
Stereo Disparity EstimationScene Flowone pixel error4.7HITNet L
Stereo Disparity EstimationScene Flowthree pixel error2.57HITNet L
Stereo Disparity EstimationScene FlowEPE0.36HITNet XL
Stereo Disparity EstimationScene Flowone pixel error4.09HITNet XL
Stereo Disparity EstimationScene Flowthree pixel error2.21HITNet XL
Stereo Depth EstimationKITTI2015three pixel error2.43HITNET

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17Cameras as Relative Positional Encoding2025-07-14Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts2025-07-07RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather2025-07-02ESMStereo: Enhanced ShuffleMixer Disparity Upsampling for Real-Time and Accurate Stereo Matching2025-06-26StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation2025-06-25DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning2025-06-17SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams2025-05-26