TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MiVOLO: Multi-input Transformer for Age and Gender Estimat...

MiVOLO: Multi-input Transformer for Age and Gender Estimation

Maksim Kuprashevich, Irina Tolstykh

2023-07-10Facial Attribute ClassificationAge EstimationAge and Gender EstimationAge And Gender ClassificationGender Prediction
PaperPDFCodeCode(official)

Abstract

Age and gender recognition in the wild is a highly challenging task: apart from the variability of conditions, pose complexities, and varying image quality, there are cases where the face is partially or completely occluded. We present MiVOLO (Multi Input VOLO), a straightforward approach for age and gender estimation using the latest vision transformer. Our method integrates both tasks into a unified dual input/output model, leveraging not only facial information but also person image data. This improves the generalization ability of our model and enables it to deliver satisfactory results even when the face is not visible in the image. To evaluate our proposed model, we conduct experiments on four popular benchmarks and achieve state-of-the-art performance, while demonstrating real-time processing capabilities. Additionally, we introduce a novel benchmark based on images from the Open Images Dataset. The ground truth annotations for this benchmark have been meticulously generated by human annotators, resulting in high accuracy answers due to the smart aggregation of votes. Furthermore, we compare our model's age recognition performance with human-level accuracy and demonstrate that it significantly outperforms humans across a majority of age ranges. Finally, we grant public access to our models, along with the code for validation and inference. In addition, we provide extra annotations for used datasets and introduce our new benchmark.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingLAGENDAMAE3.99MiVOLO-D1
Facial Recognition and ModellingAgeDBMAE5.55MiVOLO-D1
Facial Recognition and ModellingIMDB-CleanAverage mean absolute error4.09MiVOLO-D1
Facial Recognition and ModellingIMDB-CleanAverage mean absolute error4.22VOLO-D1 age&gender
Facial Recognition and ModellingUTKFaceMAE3.7MiVOLO-D1
Facial Recognition and ModellingUTKFaceMAE4.23VOLO-D1 age&gender
Facial Recognition and ModellingAgeDBAccuracy98.3MiVOLO-D1
Facial Recognition and ModellingLAGENDAAccuracy97.36MiVOLO-D1
Facial Recognition and ModellingFairFaceage-top161.07MiVOLO-D1
Facial Recognition and ModellingFairFacegender-top195.73MiVOLO-D1
Facial Recognition and ModellingAdience GenderAccuracy (5-fold)96.51MiVOLO-D1
Facial Recognition and ModellingAdience AgeAccuracy (5-fold)68.69MiVOLO-D1
Face ReconstructionLAGENDAMAE3.99MiVOLO-D1
Face ReconstructionAgeDBMAE5.55MiVOLO-D1
Face ReconstructionIMDB-CleanAverage mean absolute error4.09MiVOLO-D1
Face ReconstructionIMDB-CleanAverage mean absolute error4.22VOLO-D1 age&gender
Face ReconstructionUTKFaceMAE3.7MiVOLO-D1
Face ReconstructionUTKFaceMAE4.23VOLO-D1 age&gender
Face ReconstructionAgeDBAccuracy98.3MiVOLO-D1
Face ReconstructionLAGENDAAccuracy97.36MiVOLO-D1
Face ReconstructionFairFaceage-top161.07MiVOLO-D1
Face ReconstructionFairFacegender-top195.73MiVOLO-D1
Face ReconstructionAdience GenderAccuracy (5-fold)96.51MiVOLO-D1
Face ReconstructionAdience AgeAccuracy (5-fold)68.69MiVOLO-D1
3DLAGENDAMAE3.99MiVOLO-D1
3DAgeDBMAE5.55MiVOLO-D1
3DIMDB-CleanAverage mean absolute error4.09MiVOLO-D1
3DIMDB-CleanAverage mean absolute error4.22VOLO-D1 age&gender
3DUTKFaceMAE3.7MiVOLO-D1
3DUTKFaceMAE4.23VOLO-D1 age&gender
3DAgeDBAccuracy98.3MiVOLO-D1
3DLAGENDAAccuracy97.36MiVOLO-D1
3DFairFaceage-top161.07MiVOLO-D1
3DFairFacegender-top195.73MiVOLO-D1
3DAdience GenderAccuracy (5-fold)96.51MiVOLO-D1
3DAdience AgeAccuracy (5-fold)68.69MiVOLO-D1
3D Face ModellingLAGENDAMAE3.99MiVOLO-D1
3D Face ModellingAgeDBMAE5.55MiVOLO-D1
3D Face ModellingIMDB-CleanAverage mean absolute error4.09MiVOLO-D1
3D Face ModellingIMDB-CleanAverage mean absolute error4.22VOLO-D1 age&gender
3D Face ModellingUTKFaceMAE3.7MiVOLO-D1
3D Face ModellingUTKFaceMAE4.23VOLO-D1 age&gender
3D Face ModellingAgeDBAccuracy98.3MiVOLO-D1
3D Face ModellingLAGENDAAccuracy97.36MiVOLO-D1
3D Face ModellingFairFaceage-top161.07MiVOLO-D1
3D Face ModellingFairFacegender-top195.73MiVOLO-D1
3D Face ModellingAdience GenderAccuracy (5-fold)96.51MiVOLO-D1
3D Face ModellingAdience AgeAccuracy (5-fold)68.69MiVOLO-D1
3D Face ReconstructionLAGENDAMAE3.99MiVOLO-D1
3D Face ReconstructionAgeDBMAE5.55MiVOLO-D1
3D Face ReconstructionIMDB-CleanAverage mean absolute error4.09MiVOLO-D1
3D Face ReconstructionIMDB-CleanAverage mean absolute error4.22VOLO-D1 age&gender
3D Face ReconstructionUTKFaceMAE3.7MiVOLO-D1
3D Face ReconstructionUTKFaceMAE4.23VOLO-D1 age&gender
3D Face ReconstructionAgeDBAccuracy98.3MiVOLO-D1
3D Face ReconstructionLAGENDAAccuracy97.36MiVOLO-D1
3D Face ReconstructionFairFaceage-top161.07MiVOLO-D1
3D Face ReconstructionFairFacegender-top195.73MiVOLO-D1
3D Face ReconstructionAdience GenderAccuracy (5-fold)96.51MiVOLO-D1
3D Face ReconstructionAdience AgeAccuracy (5-fold)68.69MiVOLO-D1
Age and Gender EstimationLAGENDA genderAccuracy97.36MiVOLO-D1
Age and Gender EstimationLAGENDA ageCS@571.27MiVOLO-D1
Age and Gender EstimationLAGENDA ageMAE3.99MiVOLO-D1
Age EstimationLAGENDAMAE3.99MiVOLO-D1
Age EstimationAgeDBMAE5.55MiVOLO-D1
Age EstimationIMDB-CleanAverage mean absolute error4.09MiVOLO-D1
Age EstimationIMDB-CleanAverage mean absolute error4.22VOLO-D1 age&gender
Age EstimationUTKFaceMAE3.7MiVOLO-D1
Age EstimationUTKFaceMAE4.23VOLO-D1 age&gender
Age And Gender ClassificationAdience GenderAccuracy (5-fold)96.51MiVOLO-D1
Age And Gender ClassificationAdience AgeAccuracy (5-fold)68.69MiVOLO-D1

Related Papers

DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation2025-07-17Data-Driven Meta-Analysis and Public-Dataset Evaluation for Sensor-Based Gait Age Estimation2025-07-15Fusing Radiomic Features with Deep Representations for Gestational Age Estimation in Fetal Ultrasound Images2025-06-25AGE-US: automated gestational age estimation based on fetal ultrasound images2025-06-19Foundation Artificial Intelligence Models for Health Recognition Using Face Photographs (FAHR-Face)2025-06-17Overcoming Occlusions in the Wild: A Multi-Task Age Head Approach to Age Estimation2025-06-16Underage Detection through a Multi-Task and MultiAge Approach for Screening Minors in Unconstrained Imagery2025-06-12A Manually Annotated Image-Caption Dataset for Detecting Children in the Wild2025-06-11