TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Supervised Monocular Scene Flow Estimation

Self-Supervised Monocular Scene Flow Estimation

Junhwa Hur, Stefan Roth

2020-04-08CVPR 2020 6Optical Flow EstimationSelf-Supervised LearningScene Flow EstimationDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Scene flow estimation has been receiving increasing attention for 3D environment perception. Monocular scene flow estimation -- obtaining 3D structure and 3D motion from two temporally consecutive images -- is a highly ill-posed problem, and practical solutions are lacking to date. We propose a novel monocular scene flow method that yields competitive accuracy and real-time performance. By taking an inverse problem view, we design a single convolutional neural network (CNN) that successfully estimates depth and 3D motion simultaneously from a classical optical flow cost volume. We adopt self-supervised learning with 3D loss functions and occlusion reasoning to leverage unlabeled data. We validate our design choices, including the proxy loss and augmentation setup. Our model achieves state-of-the-art accuracy among unsupervised/self-supervised learning approaches to monocular scene flow, and yields competitive results for the optical flow and monocular depth estimation sub-tasks. Semi-supervised fine-tuning further improves the accuracy and yields promising results in real-time.

Results

TaskDatasetMetricValueModel
Scene Flow EstimationKITTI 2015 Scene Flow TestD1-all34.02Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TestD2-all36.34Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TestFl-all23.54Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TestRuntime (s)0.09Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TestSF-all49.54Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow Training Runtime (s)0.09Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TrainingD1-all31.25Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TrainingD2-all34.86Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TrainingFl-all23.49Self-Mono-SF
Scene Flow EstimationKITTI 2015 Scene Flow TrainingSF-all47.05Self-Mono-SF

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15