TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/AANet: Adaptive Aggregation Network for Efficient Stereo M...

AANet: Adaptive Aggregation Network for Efficient Stereo Matching

Haofei Xu, Juyong Zhang

2020-04-20CVPR 2020 6Stereo MatchingStereo Disparity EstimationScene Flow Estimation
PaperPDFCode(official)

Abstract

Despite the remarkable progress made by learning based stereo matching algorithms, one key challenge remains unsolved. Current state-of-the-art stereo models are mostly based on costly 3D convolutions, the cubic computational complexity and high memory consumption make it quite expensive to deploy in real-world applications. In this paper, we aim at completely replacing the commonly used 3D convolutions to achieve fast inference speed while maintaining comparable accuracy. To this end, we first propose a sparse points based intra-scale cost aggregation method to alleviate the well-known edge-fattening issue at disparity discontinuities. Further, we approximate traditional cross-scale cost aggregation algorithm with neural network layers to handle large textureless regions. Both modules are simple, lightweight, and complementary, leading to an effective and efficient architecture for cost aggregation. With these two modules, we can not only significantly speed up existing top-performing models (e.g., $41\times$ than GC-Net, $4\times$ than PSMNet and $38\times$ than GA-Net), but also improve the performance of fast stereo models (e.g., StereoNet). We also achieve competitive results on Scene Flow and KITTI datasets while running at 62ms, demonstrating the versatility and high efficiency of the proposed method. Our full framework is available at https://github.com/haofeixu/aanet .

Results

TaskDatasetMetricValueModel
Scene Flow EstimationScene FlowEPE0.068AANet
Stereo Disparity EstimationScene FlowEPE0.87AANet
Stereo Disparity EstimationScene Flowone pixel error9.3AANet
Stereo Disparity EstimationScene FlowEPE0.72AANet+
Stereo Disparity EstimationScene Flowone pixel error7.4AANet+

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second2025-07-14Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts2025-07-07RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather2025-07-02ESMStereo: Enhanced ShuffleMixer Disparity Upsampling for Real-Time and Accurate Stereo Matching2025-06-26StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation2025-06-25DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning2025-06-17KDMOS:Knowledge Distillation for Motion Segmentation2025-06-17