TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MIME: Human-Aware 3D Scene Generation

MIME: Human-Aware 3D Scene Generation

Hongwei Yi, Chun-Hao P. Huang, Shashank Tripathi, Lea Hering, Justus Thies, Michael J. Black

2022-12-08CVPR 2023 12D Semantic Segmentation task 1 (8 classes)Indoor Scene SynthesisScene Generation3D Semantic Scene Completion
PaperPDF

Abstract

Generating realistic 3D worlds occupied by moving humans has many applications in games, architecture, and synthetic data creation. But generating such scenes is expensive and labor intensive. Recent work generates human poses and motions given a 3D scene. Here, we take the opposite approach and generate 3D indoor scenes given 3D human motion. Such motions can come from archival motion capture or from IMU sensors worn on the body, effectively turning human movement in a "scanner" of the 3D world. Intuitively, human movement indicates the free-space in a room and human contact indicates surfaces or objects that support activities such as sitting, lying or touching. We propose MIME (Mining Interaction and Movement to infer 3D Environments), which is a generative model of indoor scenes that produces furniture layouts that are consistent with the human movement. MIME uses an auto-regressive transformer architecture that takes the already generated objects in the scene as well as the human motion as input, and outputs the next plausible object. To train MIME, we build a dataset by populating the 3D FRONT scene dataset with 3D humans. Our experiments show that MIME produces more diverse and plausible 3D scenes than a recent generative scene method that does not know about human movement. Code and data will be available for research at https://mime.is.tue.mpg.de.

Results

TaskDatasetMetricValueModel
3D ReconstructionPRO-teXtCD2.0493MIME
3D ReconstructionPRO-teXtCMD1.3832MIME
3D ReconstructionPRO-teXtF10.099MIME
Scene ParsingPRO-teXtCD2.0493MIME
Scene ParsingPRO-teXtEMD1.3832MIME
Scene ParsingPRO-teXtF10.099MIME
3DPRO-teXtCD2.0493MIME
3DPRO-teXtCMD1.3832MIME
3DPRO-teXtF10.099MIME
2D Semantic SegmentationPRO-teXtCD2.0493MIME
2D Semantic SegmentationPRO-teXtEMD1.3832MIME
2D Semantic SegmentationPRO-teXtF10.099MIME
3D Semantic Scene CompletionPRO-teXtCD2.0493MIME
3D Semantic Scene CompletionPRO-teXtCMD1.3832MIME
3D Semantic Scene CompletionPRO-teXtF10.099MIME

Related Papers

World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17$I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting2025-07-12Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion2025-07-11Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07Voyaging into Unbounded Dynamic Scenes from a Single View2025-07-05XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation2025-06-26From 2D to 3D Cognition: A Brief Survey of General World Models2025-06-25WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration2025-06-25