Deep Cascaded Bi-Network for Face Hallucination

Shizhan Zhu, Sifei Liu, Chen Change Loy, Xiaoou Tang

2016-07-18Face Hallucination Hallucination

Abstract

We present a novel framework for hallucinating faces of unconstrained poses and with very low resolution (face size as small as 5pxIOD). In contrast to existing studies that mostly ignore or assume pre-aligned face spatial configuration (e.g. facial landmarks localization or dense correspondence field), we alternatingly optimize two complementary tasks, namely face hallucination and dense correspondence field estimation, in a unified framework. In addition, we propose a new gated deep bi-network that contains two functionality-specialized branches to recover different levels of texture details. Extensive experiments demonstrate that such formulation allows exceptional hallucination quality on in-the-wild low-res faces with significant pose and illumination variations.

Results

Task	Dataset	Metric	Value	Model
Super-Resolution	WebFace - 8x upscaling	PSNR	23.1	CBN
Super-Resolution	VggFace2 - 8x upscaling	PSNR	21.84	CBN
Image Super-Resolution	WebFace - 8x upscaling	PSNR	23.1	CBN
Image Super-Resolution	VggFace2 - 8x upscaling	PSNR	21.84	CBN
3D Object Super-Resolution	WebFace - 8x upscaling	PSNR	23.1	CBN
3D Object Super-Resolution	VggFace2 - 8x upscaling	PSNR	21.84	CBN
16k	WebFace - 8x upscaling	PSNR	23.1	CBN
16k	VggFace2 - 8x upscaling	PSNR	21.84	CBN

Related Papers

Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16 ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way2025-07-11 UQLM: A Python Package for Uncertainty Quantification in Large Language Models2025-07-08 DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning2025-07-07 ReLoop: "Seeing Twice and Thinking Backwards" via Closed-loop Training to Mitigate Hallucinations in Multimodal understanding2025-07-07 The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems2025-07-02 GAF-Guard: An Agentic Framework for Risk Management and Governance in Large Language Models2025-07-01 Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration2025-06-26