TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dual-Branch Network for Portrait Image Quality Assessment

Dual-Branch Network for Portrait Image Quality Assessment

Wei Sun, Weixia Zhang, Yanwei Jiang, HaoNing Wu, ZiCheng Zhang, Jun Jia, Yingjie Zhou, Zhongpeng Ji, Xiongkuo Min, Weisi Lin, Guangtao Zhai

2024-05-14Scene ClassificationLearning-To-RankVideo Quality AssessmentImage Quality AssessmentFace Image Quality Assessment
PaperPDFCode(official)

Abstract

Portrait images typically consist of a salient person against diverse backgrounds. With the development of mobile devices and image processing techniques, users can conveniently capture portrait images anytime and anywhere. However, the quality of these portraits may suffer from the degradation caused by unfavorable environmental conditions, subpar photography techniques, and inferior capturing devices. In this paper, we introduce a dual-branch network for portrait image quality assessment (PIQA), which can effectively address how the salient person and the background of a portrait image influence its visual quality. Specifically, we utilize two backbone networks (\textit{i.e.,} Swin Transformer-B) to extract the quality-aware features from the entire portrait image and the facial image cropped from it. To enhance the quality-aware feature representation of the backbones, we pre-train them on the large-scale video quality assessment dataset LSVQ and the large-scale facial image quality assessment dataset GFIQA. Additionally, we leverage LIQE, an image scene classification and quality assessment model, to capture the quality-aware and scene-specific features as the auxiliary features. Finally, we concatenate these features and regress them into quality scores via a multi-perception layer (MLP). We employ the fidelity loss to train the model via a learning-to-rank manner to mitigate inconsistencies in quality scores in the portrait image quality assessment dataset PIQ. Experimental results demonstrate that the proposed model achieves superior performance in the PIQ dataset, validating its effectiveness. The code is available at \url{https://github.com/sunwei925/DN-PIQA.git}.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingPIQ23KRCC0.68Dual-Branch Network
Facial Recognition and ModellingPIQ23MAE0.53Dual-Branch Network
Facial Recognition and ModellingPIQ23PLCC0.86Dual-Branch Network
Facial Recognition and ModellingPIQ23SRCC0.85Dual-Branch Network
Face ReconstructionPIQ23KRCC0.68Dual-Branch Network
Face ReconstructionPIQ23MAE0.53Dual-Branch Network
Face ReconstructionPIQ23PLCC0.86Dual-Branch Network
Face ReconstructionPIQ23SRCC0.85Dual-Branch Network
Face RecognitionPIQ23KRCC0.68Dual-Branch Network
Face RecognitionPIQ23MAE0.53Dual-Branch Network
Face RecognitionPIQ23PLCC0.86Dual-Branch Network
Face RecognitionPIQ23SRCC0.85Dual-Branch Network
3DPIQ23KRCC0.68Dual-Branch Network
3DPIQ23MAE0.53Dual-Branch Network
3DPIQ23PLCC0.86Dual-Branch Network
3DPIQ23SRCC0.85Dual-Branch Network
3D Face ModellingPIQ23KRCC0.68Dual-Branch Network
3D Face ModellingPIQ23MAE0.53Dual-Branch Network
3D Face ModellingPIQ23PLCC0.86Dual-Branch Network
3D Face ModellingPIQ23SRCC0.85Dual-Branch Network
3D Face ReconstructionPIQ23KRCC0.68Dual-Branch Network
3D Face ReconstructionPIQ23MAE0.53Dual-Branch Network
3D Face ReconstructionPIQ23PLCC0.86Dual-Branch Network
3D Face ReconstructionPIQ23SRCC0.85Dual-Branch Network

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment2025-07-17Text-Visual Semantic Constrained AI-Generated Image Quality Assessment2025-07-144KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09Kamae: Bridging Spark and Keras for Seamless ML Preprocessing2025-07-08Bridging Video Quality Scoring and Justification via Large Multimodal Models2025-06-26Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition2025-06-25Unidentified and Confounded? Understanding Two-Tower Models for Unbiased Learning to Rank2025-06-25