TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Specifying Object Attributes and Relations in Interactive ...

Specifying Object Attributes and Relations in Interactive Scene Generation

Oron Ashual, Lior Wolf

2019-09-11ICCV 2019 10Scene GenerationLayout-to-Image Generation
PaperPDFCodeCode(official)

Abstract

We introduce a method for the generation of images from an input scene graph. The method separates between a layout embedding and an appearance embedding. The dual embedding leads to generated images that better match the scene graph, have higher visual quality, and support more complex scene graphs. In addition, the embedding scheme supports multiple and diverse output images per scene graph, which can be further controlled by the user. We demonstrate two modes of per-object control: (i) importing elements from other images, and (ii) navigation in the object space, by selecting an appearance archetype. Our code is publicly available at https://www.github.com/ashual/scene_generation

Results

TaskDatasetMetricValueModel
Image GenerationCOCO-Stuff 128x128FID59.5SOARISG
Image GenerationCOCO-Stuff 128x128Inception Score12.5SOARISG
Image GenerationCOCO-Stuff 128x128SceneFID33.46SOARISG
Image GenerationCOCO-Stuff 64x64FID48.7SOARISG
Image GenerationCOCO-Stuff 64x64Inception Score10.3SOARISG

Related Papers

World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17$I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting2025-07-12Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07Voyaging into Unbounded Dynamic Scenes from a Single View2025-07-05XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation2025-06-26From 2D to 3D Cognition: A Brief Survey of General World Models2025-06-25WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration2025-06-25DreamAnywhere: Object-Centric Panoramic 3D Scene Generation2025-06-25