TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Floors are Flat: Leveraging Semantics for Real-Time Surfac...

Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction

Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan Essa

2019-06-16Surface Normals EstimationSemantic Segmentation
PaperPDFCode(official)

Abstract

We propose 4 insights that help to significantly improve the performance of deep learning models that predict surface normals and semantic labels from a single RGB image. These insights are: (1) denoise the "ground truth" surface normals in the training set to ensure consistency with the semantic labels; (2) concurrently train on a mix of real and synthetic data, instead of pretraining on synthetic and finetuning on real; (3) jointly predict normals and semantics using a shared model, but only backpropagate errors on pixels that have valid training labels; (4) slim down the model and use grayscale instead of color inputs. Despite the simplicity of these steps, we demonstrate consistently improved results on several datasets, using a model that runs at 12 fps on a standard mobile phone.

Results

TaskDatasetMetricValueModel
Semantic SegmentationScanNetV2Pixel Accuracy65.6Floors are Flat
Surface Normals EstimationScanNetV2% < 11.2550.9Floors are Flat
Surface Normals EstimationScanNetV2% < 22.565.2Floors are Flat
Surface Normals EstimationScanNetV2% < 3070Floors are Flat
Surface Normals EstimationScanNetV2Mean Angle Error28Floors are Flat
Surface Normals EstimationNYU Depth v2% < 11.2559.5Floors are Flat
Surface Normals EstimationNYU Depth v2% < 22.572.2Floors are Flat
Surface Normals EstimationNYU Depth v2% < 3077.3Floors are Flat
Surface Normals EstimationNYU Depth v2Mean Angle Error19.7Floors are Flat
Surface Normals EstimationNYU Depth v2RMSE19.3Floors are Flat
10-shot image generationScanNetV2Pixel Accuracy65.6Floors are Flat

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15