Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation

Maksim Kuprashevich, Grigorii Alekseenko, Irina Tolstykh

2024-03-04Facial Attribute Classification Age Estimation Age and Gender Estimation Age And Gender Classification Gender Prediction General Knowledge

Paper PDF Code(official)

Abstract

Multimodal Large Language Models (MLLMs) have recently gained immense popularity. Powerful commercial models like ChatGPT-4V and Gemini, as well as open-source ones such as LLaVA, are essentially general-purpose models and are applied to solve a wide variety of tasks, including those in computer vision. These neural networks possess such strong general knowledge and reasoning abilities that they have proven capable of working even on tasks for which they were not specifically trained. We compared the capabilities of the most powerful MLLMs to date: ShareGPT4V, ChatGPT, LLaVA-Next in a specialized task of age and gender estimation with our state-of-the-art specialized model, MiVOLO. We also updated MiVOLO and provide details and new metrics in this article. This comparison has yielded some interesting results and insights about the strengths and weaknesses of the participating models. Furthermore, we attempted various ways to fine-tune the ShareGPT4V model for this specific task, aiming to achieve state-of-the-art results in this particular challenge. Although such a model would not be practical in production, as it is incredibly expensive compared to a specialized model like MiVOLO, it could be very useful in some tasks, like data annotation.

Results

Task	Dataset	Metric	Value	Model
Facial Recognition and Modelling	LAGENDA	MAE	3.65	MiVOLO-V2
Facial Recognition and Modelling	IMDB-Clean	Average mean absolute error	3.97	MiVOLO-V2
Facial Recognition and Modelling	CACD	MAE	3.89	MiVOLO-V2
Facial Recognition and Modelling	LAGENDA	Accuracy	97.99	MiVOLO-V2
Facial Recognition and Modelling	FairFace	age-top1	62.28	MiVOLO-V2
Facial Recognition and Modelling	FairFace	gender-top1	97.5	MiVOLO-V2
Facial Recognition and Modelling	Adience Gender	Accuracy (5-fold)	97.39	MiVOLO-V2
Facial Recognition and Modelling	Adience Age	Accuracy (5-fold)	69.43	MiVOLO-V2
Face Reconstruction	LAGENDA	MAE	3.65	MiVOLO-V2
Face Reconstruction	IMDB-Clean	Average mean absolute error	3.97	MiVOLO-V2
Face Reconstruction	CACD	MAE	3.89	MiVOLO-V2
Face Reconstruction	LAGENDA	Accuracy	97.99	MiVOLO-V2
Face Reconstruction	FairFace	age-top1	62.28	MiVOLO-V2
Face Reconstruction	FairFace	gender-top1	97.5	MiVOLO-V2
Face Reconstruction	Adience Gender	Accuracy (5-fold)	97.39	MiVOLO-V2
Face Reconstruction	Adience Age	Accuracy (5-fold)	69.43	MiVOLO-V2
3D	LAGENDA	MAE	3.65	MiVOLO-V2
3D	IMDB-Clean	Average mean absolute error	3.97	MiVOLO-V2
3D	CACD	MAE	3.89	MiVOLO-V2
3D	LAGENDA	Accuracy	97.99	MiVOLO-V2
3D	FairFace	age-top1	62.28	MiVOLO-V2
3D	FairFace	gender-top1	97.5	MiVOLO-V2
3D	Adience Gender	Accuracy (5-fold)	97.39	MiVOLO-V2
3D	Adience Age	Accuracy (5-fold)	69.43	MiVOLO-V2
3D Face Modelling	LAGENDA	MAE	3.65	MiVOLO-V2
3D Face Modelling	IMDB-Clean	Average mean absolute error	3.97	MiVOLO-V2
3D Face Modelling	CACD	MAE	3.89	MiVOLO-V2
3D Face Modelling	LAGENDA	Accuracy	97.99	MiVOLO-V2
3D Face Modelling	FairFace	age-top1	62.28	MiVOLO-V2
3D Face Modelling	FairFace	gender-top1	97.5	MiVOLO-V2
3D Face Modelling	Adience Gender	Accuracy (5-fold)	97.39	MiVOLO-V2
3D Face Modelling	Adience Age	Accuracy (5-fold)	69.43	MiVOLO-V2
3D Face Reconstruction	LAGENDA	MAE	3.65	MiVOLO-V2
3D Face Reconstruction	IMDB-Clean	Average mean absolute error	3.97	MiVOLO-V2
3D Face Reconstruction	CACD	MAE	3.89	MiVOLO-V2
3D Face Reconstruction	LAGENDA	Accuracy	97.99	MiVOLO-V2
3D Face Reconstruction	FairFace	age-top1	62.28	MiVOLO-V2
3D Face Reconstruction	FairFace	gender-top1	97.5	MiVOLO-V2
3D Face Reconstruction	Adience Gender	Accuracy (5-fold)	97.39	MiVOLO-V2
3D Face Reconstruction	Adience Age	Accuracy (5-fold)	69.43	MiVOLO-V2
Age and Gender Estimation	LAGENDA gender	CS@5	74.48	MiVOLO-V2
Age and Gender Estimation	LAGENDA age	CS@5	74.48	MiVOLO-V2
Age and Gender Estimation	LAGENDA age	MAE	3.65	MiVOLO-V2
Age Estimation	LAGENDA	MAE	3.65	MiVOLO-V2
Age Estimation	IMDB-Clean	Average mean absolute error	3.97	MiVOLO-V2
Age Estimation	CACD	MAE	3.89	MiVOLO-V2
Age And Gender Classification	Adience Gender	Accuracy (5-fold)	97.39	MiVOLO-V2
Age And Gender Classification	Adience Age	Accuracy (5-fold)	69.43	MiVOLO-V2

Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation

Abstract

Results

Related Papers

Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation

Abstract

Results

Related Papers