TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Channel-Wise Attention-Based Network for Self-Supervised M...

Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation

Jiaxing Yan, Hong Zhao, Penghui Bu, YuSheng Jin

2021-12-24Self-Supervised LearningScene UnderstandingDepth PredictionDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Self-supervised learning has shown very promising results for monocular depth estimation. Scene structure and local details both are significant clues for high-quality depth estimation. Recent works suffer from the lack of explicit modeling of scene structure and proper handling of details information, which leads to a performance bottleneck and blurry artefacts in predicted results. In this paper, we propose the Channel-wise Attention-based Depth Estimation Network (CADepth-Net) with two effective contributions: 1) The structure perception module employs the self-attention mechanism to capture long-range dependencies and aggregates discriminative features in channel dimensions, explicitly enhances the perception of scene structure, obtains the better scene understanding and rich feature representation. 2) The detail emphasis module re-calibrates channel-wise feature maps and selectively emphasizes the informative features, aiming to highlight crucial local details information and fuse different level features more efficiently, resulting in more precise and sharper depth prediction. Furthermore, the extensive experiments validate the effectiveness of our method and show that our model achieves the state-of-the-art results on the KITTI benchmark and Make3D datasets.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen split unsupervisedRMSE4.264CADepth-Net (MS+1024x320)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.173CADepth-Net (MS+1024x320)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.694CADepth-Net (MS+1024x320)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.096CADepth-Net (MS+1024x320)
3DKITTI Eigen split unsupervisedRMSE4.264CADepth-Net (MS+1024x320)
3DKITTI Eigen split unsupervisedRMSE log0.173CADepth-Net (MS+1024x320)
3DKITTI Eigen split unsupervisedSq Rel0.694CADepth-Net (MS+1024x320)
3DKITTI Eigen split unsupervisedabsolute relative error0.096CADepth-Net (MS+1024x320)

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16