TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PWOC-3D: Deep Occlusion-Aware End-to-End Scene Flow Estima...

PWOC-3D: Deep Occlusion-Aware End-to-End Scene Flow Estimation

Rohan Saxena, René Schuster, Oliver Wasenmüller, Didier Stricker

2019-04-12Stereo MatchingStereo Matching HandOptical Flow EstimationScene Flow Estimation
PaperPDFCode(official)

Abstract

In the last few years, convolutional neural networks (CNNs) have demonstrated increasing success at learning many computer vision tasks including dense estimation problems such as optical flow and stereo matching. However, the joint prediction of these tasks, called scene flow, has traditionally been tackled using slow classical methods based on primitive assumptions which fail to generalize. The work presented in this paper overcomes these drawbacks efficiently (in terms of speed and accuracy) by proposing PWOC-3D, a compact CNN architecture to predict scene flow from stereo image sequences in an end-to-end supervised setting. Further, large motion and occlusions are well-known problems in scene flow estimation. PWOC-3D employs specialized design decisions to explicitly model these challenges. In this regard, we propose a novel self-supervised strategy to predict occlusions from images (learned without any labeled occlusion data). Leveraging several such constructs, our network achieves competitive results on the KITTI benchmark and the challenging FlyingThings3D dataset. Especially on KITTI, PWOC-3D achieves the second place among end-to-end deep learning methods with 48 times fewer parameters than the top-performing method.

Results

TaskDatasetMetricValueModel
Scene Flow EstimationKITTI 2015 Scene Flow TestD1-all5.13PWOC-3D
Scene Flow EstimationKITTI 2015 Scene Flow TestD2-all8.46PWOC-3D
Scene Flow EstimationKITTI 2015 Scene Flow TestFl-all12.96PWOC-3D
Scene Flow EstimationKITTI 2015 Scene Flow TestRuntime (s)0.13PWOC-3D
Scene Flow EstimationKITTI 2015 Scene Flow TestSF-all15.69PWOC-3D

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second2025-07-14An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts2025-07-07TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather2025-07-02