Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem
We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e.g. L-shape room). Our method operates directly on the panoramic image, rather than decomposing into perspective images as do recent works. Our network architecture is similar to that of RoomNet, but we show improvements due to aligning the image based on vanishing points, predicting multiple layout elements (corners, boundaries, size and translation), and fitting a constrained Manhattan layout to the resulting predictions. Our method compares well in speed and accuracy to other existing work on panoramas, achieves among the best accuracy for perspective images, and can handle both cuboid-shaped and more general Manhattan layouts.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| 3D Reconstruction | Stanford2D3D Panoramic | 3DIoU | 76.33 | LayoutNet |
| 3D Reconstruction | Stanford2D3D Panoramic | Corner Error | 1.04 | LayoutNet |
| 3D Reconstruction | Stanford2D3D Panoramic | Pixel Error | 2.7 | LayoutNet |
| 3D Reconstruction | PanoContext | 3DIoU | 74.48 | LayoutNet |
| Scene Parsing | Stanford2D3D Panoramic | 3DIoU | 76.33 | LayoutNet |
| Scene Parsing | Stanford2D3D Panoramic | Corner Error | 1.04 | LayoutNet |
| Scene Parsing | Stanford2D3D Panoramic | Pixel Error | 2.7 | LayoutNet |
| Scene Parsing | PanoContext | 3DIoU | 74.48 | LayoutNet |
| 3D | Stanford2D3D Panoramic | 3DIoU | 76.33 | LayoutNet |
| 3D | Stanford2D3D Panoramic | Corner Error | 1.04 | LayoutNet |
| 3D | Stanford2D3D Panoramic | Pixel Error | 2.7 | LayoutNet |
| 3D | PanoContext | 3DIoU | 74.48 | LayoutNet |
| Scene Understanding | Stanford2D3D Panoramic | 3DIoU | 76.33 | LayoutNet |
| Scene Understanding | Stanford2D3D Panoramic | Corner Error | 1.04 | LayoutNet |
| Scene Understanding | Stanford2D3D Panoramic | Pixel Error | 2.7 | LayoutNet |
| Scene Understanding | PanoContext | 3DIoU | 74.48 | LayoutNet |
| 2D Semantic Segmentation | Stanford2D3D Panoramic | 3DIoU | 76.33 | LayoutNet |
| 2D Semantic Segmentation | Stanford2D3D Panoramic | Corner Error | 1.04 | LayoutNet |
| 2D Semantic Segmentation | Stanford2D3D Panoramic | Pixel Error | 2.7 | LayoutNet |
| 2D Semantic Segmentation | PanoContext | 3DIoU | 74.48 | LayoutNet |