TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SceneDreamer: Unbounded 3D Scene Generation from 2D Image ...

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Zhaoxi Chen, Guangcong Wang, Ziwei Liu

2023-02-02Scene Generation
PaperPDFCode(official)

Abstract

In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noise. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our approach begins with an efficient bird's-eye-view (BEV) representation generated from simplex noise, which includes a height field for surface elevation and a semantic field for detailed scene semantics. This BEV scene representation enables 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Moreover, we propose a novel generative neural hash grid to parameterize the latent space based on 3D positions and scene semantics, aiming to encode generalizable features across various scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.

Results

TaskDatasetMetricValueModel
Scene GenerationGoogleEarthCamera Error0.186SceneDreamer
Scene GenerationGoogleEarthDepth Error0.152SceneDreamer
Scene GenerationGoogleEarthFID213.56SceneDreamer
Scene GenerationGoogleEarthKID0.216SceneDreamer
16kGoogleEarthCamera Error0.186SceneDreamer
16kGoogleEarthDepth Error0.152SceneDreamer
16kGoogleEarthFID213.56SceneDreamer
16kGoogleEarthKID0.216SceneDreamer

Related Papers

World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17$I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting2025-07-12Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07Voyaging into Unbounded Dynamic Scenes from a Single View2025-07-05XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation2025-06-26From 2D to 3D Cognition: A Brief Survey of General World Models2025-06-25WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration2025-06-25DreamAnywhere: Object-Centric Panoramic 3D Scene Generation2025-06-25