ATISS: Autoregressive Transformers for Indoor Scene Synthesis

Despoina Paschalidou, Amlan Kar, Maria Shugrina, Karsten Kreis, Andreas Geiger, Sanja Fidler

2021-10-07NeurIPS 2021 122D Semantic Segmentation task 1 (8 classes)Indoor Scene Synthesis 3D Semantic Scene Completion

Abstract

The ability to synthesize realistic and diverse indoor furniture layouts automatically or based on partial input, unlocks many applications, from better interactive 3D tools to data synthesis for training and simulation. In this paper, we present ATISS, a novel autoregressive transformer architecture for creating diverse and plausible synthetic indoor environments, given only the room type and its floor plan. In contrast to prior work, which poses scene synthesis as sequence generation, our model generates rooms as unordered sets of objects. We argue that this formulation is more natural, as it makes ATISS generally useful beyond fully automatic room layout synthesis. For example, the same trained model can be used in interactive applications for general scene completion, partial room re-arrangement with any objects specified by the user, as well as object suggestions for any partial room. To enable this, our model leverages the permutation equivariance of the transformer when conditioning on the partial scene, and is trained to be permutation-invariant across object orderings. Our model is trained end-to-end as an autoregressive generative model using only labeled 3D bounding boxes as supervision. Evaluations on four room types in the 3D-FRONT dataset demonstrate that our model consistently generates plausible room layouts that are more realistic than existing methods. In addition, it has fewer parameters, is simpler to implement and train and runs up to 8 times faster than existing methods.

Results

Task	Dataset	Metric	Value	Model
3D Reconstruction	PRO-teXt	CD	2.0756	ATISS
3D Reconstruction	PRO-teXt	CMD	1.414	ATISS
3D Reconstruction	PRO-teXt	F1	0.0663	ATISS
Scene Parsing	PRO-teXt	CD	2.0756	ATISS
Scene Parsing	PRO-teXt	EMD	1.414	ATISS
Scene Parsing	PRO-teXt	F1	0.0663	ATISS
3D	PRO-teXt	CD	2.0756	ATISS
3D	PRO-teXt	CMD	1.414	ATISS
3D	PRO-teXt	F1	0.0663	ATISS
2D Semantic Segmentation	PRO-teXt	CD	2.0756	ATISS
2D Semantic Segmentation	PRO-teXt	EMD	1.414	ATISS
2D Semantic Segmentation	PRO-teXt	F1	0.0663	ATISS
3D Semantic Scene Completion	PRO-teXt	CD	2.0756	ATISS
3D Semantic Scene Completion	PRO-teXt	CMD	1.414	ATISS
3D Semantic Scene Completion	PRO-teXt	F1	0.0663	ATISS

ATISS: Autoregressive Transformers for Indoor Scene Synthesis

Abstract

Results

Related Papers

ATISS: Autoregressive Transformers for Indoor Scene Synthesis

Abstract

Results

Related Papers