TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/General Facial Representation Learning in a Visual-Linguis...

General Facial Representation Learning in a Visual-Linguistic Manner

Yinglin Zheng, Hao Yang, Ting Zhang, Jianmin Bao, Dongdong Chen, Yangyu Huang, Lu Yuan, Dong Chen, Ming Zeng, Fang Wen

2021-12-06CVPR 2022 1Face AlignmentFace ParsingRepresentation Learning
PaperPDFCode(official)Code

Abstract

How to learn a universal facial representation that boosts all face analysis tasks? This paper takes one step toward this goal. In this paper, we study the transfer performance of pre-trained models on face analysis tasks and introduce a framework, called FaRL, for general Facial Representation Learning in a visual-linguistic manner. On one hand, the framework involves a contrastive loss to learn high-level semantic meaning from image-text pairs. On the other hand, we propose exploring low-level information simultaneously to further enhance the face representation, by adding a masked image modeling. We perform pre-training on LAION-FACE, a dataset containing large amount of face image-text pairs, and evaluate the representation capability on multiple downstream tasks. We show that FaRL achieves better transfer performance compared with previous pre-trained models. We also verify its superiority in the low-data regime. More importantly, our model surpasses the state-of-the-art methods on face analysis tasks including face parsing and face alignment.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingWFW (Extra Data)AUC@10 (inter-ocular)61.16FaRL-B (epoch 16)
Facial Recognition and ModellingWFW (Extra Data)FR@10 (inter-ocular)1.76FaRL-B (epoch 16)
Facial Recognition and ModellingWFW (Extra Data)NME (inter-ocular)3.96FaRL-B (epoch 16)
Facial Recognition and ModellingAFLW-19AUC_box@0.07 (%, Full)81.3FaRL-B (epoch 16)
Facial Recognition and ModellingAFLW-19NME_box (%, Full)1.334FaRL-B (epoch 16)
Facial Recognition and ModellingAFLW-19NME_diag (%, Frontal)0.821FaRL-B (epoch 16)
Facial Recognition and ModellingAFLW-19NME_diag (%, Full)0.943FaRL-B (epoch 16)
Facial Recognition and Modelling300WNME_inter-ocular (%, Challenge)4.42FaRL-B (epoch 64)
Facial Recognition and Modelling300WNME_inter-ocular (%, Common)2.5FaRL-B (epoch 64)
Facial Recognition and Modelling300WNME_inter-ocular (%, Full)2.88FaRL-B (epoch 64)
Facial Recognition and Modelling300WNME_inter-pupil (%, Challenge)6.38FaRL-B (epoch 64)
Facial Recognition and Modelling300WNME_inter-pupil (%, Common)3.46FaRL-B (epoch 64)
Facial Recognition and Modelling300WNME_inter-pupil (%, Full)4.05FaRL-B (epoch 64)
Facial Recognition and Modelling300WNME_inter-ocular (%, Challenge)4.45FaRL-B (epoch 16)
Facial Recognition and Modelling300WNME_inter-ocular (%, Common)2.56FaRL-B (epoch 16)
Facial Recognition and Modelling300WNME_inter-ocular (%, Full)2.93FaRL-B (epoch 16)
Facial Recognition and Modelling300WNME_inter-pupil (%, Challenge)6.42FaRL-B (epoch 16)
Facial Recognition and Modelling300WNME_inter-pupil (%, Common)3.53FaRL-B (epoch 16)
Facial Recognition and Modelling300WNME_inter-pupil (%, Full)4.11FaRL-B (epoch 16)
Scene ParsingCelebAMask-HQMean F189.56FaRL-B
Scene ParsingLaPaMean F193.88FaRL-B
Face Reconstruction300WNME_inter-ocular (%, Challenge)4.42FaRL-B (epoch 64)
Face Reconstruction300WNME_inter-ocular (%, Common)2.5FaRL-B (epoch 64)
Face Reconstruction300WNME_inter-ocular (%, Full)2.88FaRL-B (epoch 64)
Face Reconstruction300WNME_inter-pupil (%, Challenge)6.38FaRL-B (epoch 64)
Face Reconstruction300WNME_inter-pupil (%, Common)3.46FaRL-B (epoch 64)
Face Reconstruction300WNME_inter-pupil (%, Full)4.05FaRL-B (epoch 64)
Face Reconstruction300WNME_inter-ocular (%, Challenge)4.45FaRL-B (epoch 16)
Face Reconstruction300WNME_inter-ocular (%, Common)2.56FaRL-B (epoch 16)
Face Reconstruction300WNME_inter-ocular (%, Full)2.93FaRL-B (epoch 16)
Face Reconstruction300WNME_inter-pupil (%, Challenge)6.42FaRL-B (epoch 16)
Face Reconstruction300WNME_inter-pupil (%, Common)3.53FaRL-B (epoch 16)
Face Reconstruction300WNME_inter-pupil (%, Full)4.11FaRL-B (epoch 16)
Face ReconstructionWFW (Extra Data)AUC@10 (inter-ocular)61.16FaRL-B (epoch 16)
Face ReconstructionWFW (Extra Data)FR@10 (inter-ocular)1.76FaRL-B (epoch 16)
Face ReconstructionWFW (Extra Data)NME (inter-ocular)3.96FaRL-B (epoch 16)
Face ReconstructionAFLW-19AUC_box@0.07 (%, Full)81.3FaRL-B (epoch 16)
Face ReconstructionAFLW-19NME_box (%, Full)1.334FaRL-B (epoch 16)
Face ReconstructionAFLW-19NME_diag (%, Frontal)0.821FaRL-B (epoch 16)
Face ReconstructionAFLW-19NME_diag (%, Full)0.943FaRL-B (epoch 16)
3D300WNME_inter-ocular (%, Challenge)4.42FaRL-B (epoch 64)
3D300WNME_inter-ocular (%, Common)2.5FaRL-B (epoch 64)
3D300WNME_inter-ocular (%, Full)2.88FaRL-B (epoch 64)
3D300WNME_inter-pupil (%, Challenge)6.38FaRL-B (epoch 64)
3D300WNME_inter-pupil (%, Common)3.46FaRL-B (epoch 64)
3D300WNME_inter-pupil (%, Full)4.05FaRL-B (epoch 64)
3D300WNME_inter-ocular (%, Challenge)4.45FaRL-B (epoch 16)
3D300WNME_inter-ocular (%, Common)2.56FaRL-B (epoch 16)
3D300WNME_inter-ocular (%, Full)2.93FaRL-B (epoch 16)
3D300WNME_inter-pupil (%, Challenge)6.42FaRL-B (epoch 16)
3D300WNME_inter-pupil (%, Common)3.53FaRL-B (epoch 16)
3D300WNME_inter-pupil (%, Full)4.11FaRL-B (epoch 16)
3DWFW (Extra Data)AUC@10 (inter-ocular)61.16FaRL-B (epoch 16)
3DWFW (Extra Data)FR@10 (inter-ocular)1.76FaRL-B (epoch 16)
3DWFW (Extra Data)NME (inter-ocular)3.96FaRL-B (epoch 16)
3DAFLW-19AUC_box@0.07 (%, Full)81.3FaRL-B (epoch 16)
3DAFLW-19NME_box (%, Full)1.334FaRL-B (epoch 16)
3DAFLW-19NME_diag (%, Frontal)0.821FaRL-B (epoch 16)
3DAFLW-19NME_diag (%, Full)0.943FaRL-B (epoch 16)
3D Face ModellingWFW (Extra Data)AUC@10 (inter-ocular)61.16FaRL-B (epoch 16)
3D Face ModellingWFW (Extra Data)FR@10 (inter-ocular)1.76FaRL-B (epoch 16)
3D Face ModellingWFW (Extra Data)NME (inter-ocular)3.96FaRL-B (epoch 16)
3D Face ModellingAFLW-19AUC_box@0.07 (%, Full)81.3FaRL-B (epoch 16)
3D Face ModellingAFLW-19NME_box (%, Full)1.334FaRL-B (epoch 16)
3D Face ModellingAFLW-19NME_diag (%, Frontal)0.821FaRL-B (epoch 16)
3D Face ModellingAFLW-19NME_diag (%, Full)0.943FaRL-B (epoch 16)
3D Face Modelling300WNME_inter-ocular (%, Challenge)4.42FaRL-B (epoch 64)
3D Face Modelling300WNME_inter-ocular (%, Common)2.5FaRL-B (epoch 64)
3D Face Modelling300WNME_inter-ocular (%, Full)2.88FaRL-B (epoch 64)
3D Face Modelling300WNME_inter-pupil (%, Challenge)6.38FaRL-B (epoch 64)
3D Face Modelling300WNME_inter-pupil (%, Common)3.46FaRL-B (epoch 64)
3D Face Modelling300WNME_inter-pupil (%, Full)4.05FaRL-B (epoch 64)
3D Face Modelling300WNME_inter-ocular (%, Challenge)4.45FaRL-B (epoch 16)
3D Face Modelling300WNME_inter-ocular (%, Common)2.56FaRL-B (epoch 16)
3D Face Modelling300WNME_inter-ocular (%, Full)2.93FaRL-B (epoch 16)
3D Face Modelling300WNME_inter-pupil (%, Challenge)6.42FaRL-B (epoch 16)
3D Face Modelling300WNME_inter-pupil (%, Common)3.53FaRL-B (epoch 16)
3D Face Modelling300WNME_inter-pupil (%, Full)4.11FaRL-B (epoch 16)
3D Face ReconstructionWFW (Extra Data)AUC@10 (inter-ocular)61.16FaRL-B (epoch 16)
3D Face ReconstructionWFW (Extra Data)FR@10 (inter-ocular)1.76FaRL-B (epoch 16)
3D Face ReconstructionWFW (Extra Data)NME (inter-ocular)3.96FaRL-B (epoch 16)
3D Face ReconstructionAFLW-19AUC_box@0.07 (%, Full)81.3FaRL-B (epoch 16)
3D Face ReconstructionAFLW-19NME_box (%, Full)1.334FaRL-B (epoch 16)
3D Face ReconstructionAFLW-19NME_diag (%, Frontal)0.821FaRL-B (epoch 16)
3D Face ReconstructionAFLW-19NME_diag (%, Full)0.943FaRL-B (epoch 16)
3D Face Reconstruction300WNME_inter-ocular (%, Challenge)4.42FaRL-B (epoch 64)
3D Face Reconstruction300WNME_inter-ocular (%, Common)2.5FaRL-B (epoch 64)
3D Face Reconstruction300WNME_inter-ocular (%, Full)2.88FaRL-B (epoch 64)
3D Face Reconstruction300WNME_inter-pupil (%, Challenge)6.38FaRL-B (epoch 64)
3D Face Reconstruction300WNME_inter-pupil (%, Common)3.46FaRL-B (epoch 64)
3D Face Reconstruction300WNME_inter-pupil (%, Full)4.05FaRL-B (epoch 64)
3D Face Reconstruction300WNME_inter-ocular (%, Challenge)4.45FaRL-B (epoch 16)
3D Face Reconstruction300WNME_inter-ocular (%, Common)2.56FaRL-B (epoch 16)
3D Face Reconstruction300WNME_inter-ocular (%, Full)2.93FaRL-B (epoch 16)
3D Face Reconstruction300WNME_inter-pupil (%, Challenge)6.42FaRL-B (epoch 16)
3D Face Reconstruction300WNME_inter-pupil (%, Common)3.53FaRL-B (epoch 16)
3D Face Reconstruction300WNME_inter-pupil (%, Full)4.11FaRL-B (epoch 16)
2D Semantic SegmentationCelebAMask-HQMean F189.56FaRL-B
2D Semantic SegmentationLaPaMean F193.88FaRL-B

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction2025-07-15Dual Dimensions Geometric Representation Learning Based Document Dewarping2025-07-11