TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HorizonNet: Learning Room Layout with 1D Representation an...

HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation

Cheng Sun, Chi-Wei Hsiao, Min Sun, Hwann-Tzong Chen

2019-01-12CVPR 2019 6Data Augmentation3D Room Layouts From A Single RGB Panorama
PaperPDFCode(official)

Abstract

We present a new approach to the problem of estimating the 3D room layout from a single panoramic image. We represent room layout as three 1D vectors that encode, at each image column, the boundary positions of floor-wall and ceiling-wall, and the existence of wall-wall boundary. The proposed network, HorizonNet, trained for predicting 1D layout, outperforms previous state-of-the-art approaches. The designed post-processing procedure for recovering 3D room layouts from 1D predictions can automatically infer the room shape with low computation cost - it takes less than 20ms for a panorama image while prior works might need dozens of seconds. We also propose Pano Stretch Data Augmentation, which can diversify panorama data and be applied to other panorama-related learning tasks. Due to the limited data available for non-cuboid layout, we relabel 65 general layout from the current dataset for finetuning. Our approach shows good performance on general layouts by qualitative results and cross-validation.

Results

TaskDatasetMetricValueModel
3D ReconstructionStanford2D3D Panoramic3DIoU79.79HorizonNet
3D ReconstructionStanford2D3D PanoramicCorner Error0.71HorizonNet
3D ReconstructionStanford2D3D PanoramicPixel Error2.39HorizonNet
3D ReconstructionPanoContext3DIoU82.17HorizonNet
Scene ParsingStanford2D3D Panoramic3DIoU79.79HorizonNet
Scene ParsingStanford2D3D PanoramicCorner Error0.71HorizonNet
Scene ParsingStanford2D3D PanoramicPixel Error2.39HorizonNet
Scene ParsingPanoContext3DIoU82.17HorizonNet
3DStanford2D3D Panoramic3DIoU79.79HorizonNet
3DStanford2D3D PanoramicCorner Error0.71HorizonNet
3DStanford2D3D PanoramicPixel Error2.39HorizonNet
3DPanoContext3DIoU82.17HorizonNet
Scene UnderstandingStanford2D3D Panoramic3DIoU79.79HorizonNet
Scene UnderstandingStanford2D3D PanoramicCorner Error0.71HorizonNet
Scene UnderstandingStanford2D3D PanoramicPixel Error2.39HorizonNet
Scene UnderstandingPanoContext3DIoU82.17HorizonNet
2D Semantic SegmentationStanford2D3D Panoramic3DIoU79.79HorizonNet
2D Semantic SegmentationStanford2D3D PanoramicCorner Error0.71HorizonNet
2D Semantic SegmentationStanford2D3D PanoramicPixel Error2.39HorizonNet
2D Semantic SegmentationPanoContext3DIoU82.17HorizonNet

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15Iceberg: Enhancing HLS Modeling with Synthetic Data2025-07-14AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation2025-07-08