TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CityDreamer: Compositional Generative Model of Unbounded 3...

CityDreamer: Compositional Generative Model of Unbounded 3D Cities

Haozhe Xie, Zhaoxi Chen, Fangzhou Hong, Ziwei Liu

2023-09-01CVPR 2024 1Scene Generation
PaperPDFCode(official)

Abstract

3D city generation is a desirable yet challenging task, since humans are more sensitive to structural distortions in urban environments. Additionally, generating 3D cities is more complex than 3D natural scenes since buildings, as objects of the same class, exhibit a wider range of appearances compared to the relatively consistent appearance of objects like trees in natural scenes. To address these challenges, we propose \textbf{CityDreamer}, a compositional generative model designed specifically for unbounded 3D cities. Our key insight is that 3D city generation should be a composition of different types of neural fields: 1) various building instances, and 2) background stuff, such as roads and green lands. Specifically, we adopt the bird's eye view scene representation and employ a volumetric render for both instance-oriented and stuff-oriented neural fields. The generative hash grid and periodic positional embedding are tailored as scene parameterization to suit the distinct characteristics of building instances and background stuff. Furthermore, we contribute a suite of CityGen Datasets, including OSM and GoogleEarth, which comprises a vast amount of real-world city imagery to enhance the realism of the generated 3D cities both in their layouts and appearances. CityDreamer achieves state-of-the-art performance not only in generating realistic 3D cities but also in localized editing within the generated cities.

Results

TaskDatasetMetricValueModel
Scene GenerationGoogleEarthCamera Error0.06CityDreamer
Scene GenerationGoogleEarthDepth Error0.147CityDreamer
Scene GenerationGoogleEarthFID97.38CityDreamer
Scene GenerationGoogleEarthKID0.096CityDreamer
16kGoogleEarthCamera Error0.06CityDreamer
16kGoogleEarthDepth Error0.147CityDreamer
16kGoogleEarthFID97.38CityDreamer
16kGoogleEarthKID0.096CityDreamer

Related Papers

World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17$I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting2025-07-12Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07Voyaging into Unbounded Dynamic Scenes from a Single View2025-07-05XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation2025-06-26From 2D to 3D Cognition: A Brief Survey of General World Models2025-06-25WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration2025-06-25DreamAnywhere: Object-Centric Panoramic 3D Scene Generation2025-06-25