TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sem2NeRF: Converting Single-View Semantic Masks to Neural ...

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

Yuedong Chen, Qianyi Wu, Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai

2022-03-21Translation3D-Aware Image Synthesis
PaperPDFCode(official)

Abstract

Image translation and manipulation have gain increasing attention along with the rapid development of deep generative models. Although existing approaches have brought impressive results, they mainly operated in 2D space. In light of recent advances in NeRF-based 3D-aware generative models, we introduce a new task, Semantic-to-NeRF translation, that aims to reconstruct a 3D scene modelled by NeRF, conditioned on one single-view semantic mask as input. To kick-off this novel task, we propose the Sem2NeRF framework. In particular, Sem2NeRF addresses the highly challenging task by encoding the semantic mask into the latent code that controls the 3D scene representation of a pre-trained decoder. To further improve the accuracy of the mapping, we integrate a new region-aware learning strategy into the design of both the encoder and the decoder. We verify the efficacy of the proposed Sem2NeRF and demonstrate that it outperforms several strong baselines on two benchmark datasets. Code and video are available at https://donydchen.github.io/sem2nerf/

Results

TaskDatasetMetricValueModel
Image GenerationCelebAMask-HQFID41.52Sem2NeRF
Image GenerationCelebAMask-HQIS2.03Sem2NeRF
Image GenerationCelebAMask-HQFID55.56pSp
Image GenerationCelebAMask-HQIS1.74pSp
Image GenerationCelebAMask-HQFID67.32pix2pixHD
Image GenerationCelebAMask-HQIS1.72pix2pixHD
3DCelebAMask-HQFID41.52Sem2NeRF
3DCelebAMask-HQIS2.03Sem2NeRF
3DCelebAMask-HQFID55.56pSp
3DCelebAMask-HQIS1.74pSp
3DCelebAMask-HQFID67.32pix2pixHD
3DCelebAMask-HQIS1.72pix2pixHD

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29