TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ATISS: Autoregressive Transformers for Indoor Scene Synthe...

ATISS: Autoregressive Transformers for Indoor Scene Synthesis

Despoina Paschalidou, Amlan Kar, Maria Shugrina, Karsten Kreis, Andreas Geiger, Sanja Fidler

2021-10-07NeurIPS 2021 122D Semantic Segmentation task 1 (8 classes)Indoor Scene Synthesis3D Semantic Scene Completion
PaperPDFCode(official)

Abstract

The ability to synthesize realistic and diverse indoor furniture layouts automatically or based on partial input, unlocks many applications, from better interactive 3D tools to data synthesis for training and simulation. In this paper, we present ATISS, a novel autoregressive transformer architecture for creating diverse and plausible synthetic indoor environments, given only the room type and its floor plan. In contrast to prior work, which poses scene synthesis as sequence generation, our model generates rooms as unordered sets of objects. We argue that this formulation is more natural, as it makes ATISS generally useful beyond fully automatic room layout synthesis. For example, the same trained model can be used in interactive applications for general scene completion, partial room re-arrangement with any objects specified by the user, as well as object suggestions for any partial room. To enable this, our model leverages the permutation equivariance of the transformer when conditioning on the partial scene, and is trained to be permutation-invariant across object orderings. Our model is trained end-to-end as an autoregressive generative model using only labeled 3D bounding boxes as supervision. Evaluations on four room types in the 3D-FRONT dataset demonstrate that our model consistently generates plausible room layouts that are more realistic than existing methods. In addition, it has fewer parameters, is simpler to implement and train and runs up to 8 times faster than existing methods.

Results

TaskDatasetMetricValueModel
3D ReconstructionPRO-teXtCD2.0756ATISS
3D ReconstructionPRO-teXtCMD1.414ATISS
3D ReconstructionPRO-teXtF10.0663ATISS
Scene ParsingPRO-teXtCD2.0756ATISS
Scene ParsingPRO-teXtEMD1.414ATISS
Scene ParsingPRO-teXtF10.0663ATISS
3DPRO-teXtCD2.0756ATISS
3DPRO-teXtCMD1.414ATISS
3DPRO-teXtF10.0663ATISS
2D Semantic SegmentationPRO-teXtCD2.0756ATISS
2D Semantic SegmentationPRO-teXtEMD1.414ATISS
2D Semantic SegmentationPRO-teXtF10.0663ATISS
3D Semantic Scene CompletionPRO-teXtCD2.0756ATISS
3D Semantic Scene CompletionPRO-teXtCMD1.414ATISS
3D Semantic Scene CompletionPRO-teXtF10.0663ATISS

Related Papers

Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion2025-07-11Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning2025-06-05ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment2025-06-03FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts2025-06-03Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes2025-05-14CasaGPT: Cuboid Arrangement and Scene Assembly for Interior Design2025-04-28SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion2025-03-21SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis2025-03-18