TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Magic3D: High-Resolution Text-to-3D Content Creation

Magic3D: High-Resolution Text-to-3D Content Creation

Chen-Hsuan Lin, Jun Gao, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, Karsten Kreis, Sanja Fidler, Ming-Yu Liu, Tsung-Yi Lin

2022-11-18CVPR 2023 1Vocal Bursts Intensity PredictionText to 3D
PaperPDFCode

Abstract

DreamFusion has recently demonstrated the utility of a pre-trained text-to-image diffusion model to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis results. However, the method has two inherent limitations: (a) extremely slow optimization of NeRF and (b) low-resolution image space supervision on NeRF, leading to low-quality 3D models with a long processing time. In this paper, we address these limitations by utilizing a two-stage optimization framework. First, we obtain a coarse model using a low-resolution diffusion prior and accelerate with a sparse 3D hash grid structure. Using the coarse representation as the initialization, we further optimize a textured 3D mesh model with an efficient differentiable renderer interacting with a high-resolution latent diffusion model. Our method, dubbed Magic3D, can create high quality 3D mesh models in 40 minutes, which is 2x faster than DreamFusion (reportedly taking 1.5 hours on average), while also achieving higher resolution. User studies show 61.7% raters to prefer our approach over DreamFusion. Together with the image-conditioned generation capabilities, we provide users with new ways to control 3D synthesis, opening up new avenues to various creative applications.

Results

TaskDatasetMetricValueModel
3DT$^3$BenchAvg32.7Magic3D
Text to Image GenerationT$^3$BenchAvg32.7Magic3D
Text to 3DT$^3$BenchAvg32.7Magic3D

Related Papers

Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07DreamAnywhere: Object-Centric Panoramic 3D Scene Generation2025-06-25Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching2025-06-16EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence2025-06-12DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision2025-06-11R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation2025-06-09AI-powered Contextual 3D Environment Generation: A Systematic Review2025-06-05ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary2025-05-31