TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SIFU: Side-view Conditioned Implicit Function for Real-wor...

SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction

Zechuan Zhang, Zongxin Yang, Yi Yang

2023-12-10CVPR 2024 1Lifelike 3D Human Generation
PaperPDFCode(official)

Abstract

Creating high-quality 3D models of clothed humans from single images for real-world applications is crucial. Despite recent advancements, accurately reconstructing humans in complex poses or with loose clothing from in-the-wild images, along with predicting textures for unseen areas, remains a significant challenge. A key limitation of previous methods is their insufficient prior guidance in transitioning from 2D to 3D and in texture prediction. In response, we introduce SIFU (Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction), a novel approach combining a Side-view Decoupling Transformer with a 3D Consistent Texture Refinement pipeline.SIFU employs a cross-attention mechanism within the transformer, using SMPL-X normals as queries to effectively decouple side-view features in the process of mapping 2D features to 3D. This method not only improves the precision of the 3D models but also their robustness, especially when SMPL-X estimates are not perfect. Our texture refinement process leverages text-to-image diffusion-based prior to generate realistic and consistent textures for invisible views. Through extensive experiments, SIFU surpasses SOTA methods in both geometry and texture reconstruction, showcasing enhanced robustness in complex scenarios and achieving an unprecedented Chamfer and P2S measurement. Our approach extends to practical applications such as 3D printing and scene building, demonstrating its broad utility in real-world scenarios. Project page https://river-zhang.github.io/SIFU-projectpage/ .

Results

TaskDatasetMetricValueModel
Lifelike 3D Human GenerationTHuman2.0 DatasetCLIP Similarity0.8663SIFU
Lifelike 3D Human GenerationTHuman2.0 DatasetLPIPS0.15SIFU
Lifelike 3D Human GenerationTHuman2.0 DatasetPSNR16.46SIFU
Lifelike 3D Human GenerationTHuman2.0 DatasetSSIM0.7931SIFU

Related Papers

Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models2024-09-04Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail2024-03-18SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion2023-11-27PaMIR: Parametric Model-Conditioned Implicit Representation for Image-based Human Reconstruction2020-07-08PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization2019-05-13