Local Learning with Deep and Handcrafted Features for Facial Expression Recognition

Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Marius Popescu

2018-04-29Facial Expression Recognition Facial Expression Recognition (FER)General Classification

Abstract

We present an approach that combines automatic features learned by convolutional neural networks (CNN) and handcrafted features computed by the bag-of-visual-words (BOVW) model in order to achieve state-of-the-art results in facial expression recognition. To obtain automatic features, we experiment with multiple CNN architectures, pre-trained models and training procedures, e.g. Dense-Sparse-Dense. After fusing the two types of features, we employ a local learning framework to predict the class label for each test image. The local learning framework is based on three steps. First, a k-nearest neighbors model is applied in order to select the nearest training samples for an input test image. Second, a one-versus-all Support Vector Machines (SVM) classifier is trained on the selected training samples. Finally, the SVM classifier is used to predict the class label only for the test image it was trained for. Although we have used local learning in combination with handcrafted features in our previous work, to the best of our knowledge, local learning has never been employed in combination with deep features. The experiments on the 2013 Facial Expression Recognition (FER) Challenge data set, the FER+ data set and the AffectNet data set demonstrate that our approach achieves state-of-the-art results. With a top accuracy of 75.42% on FER 2013, 87.76% on the FER+, 59.58% on AffectNet 8-way classification and 63.31% on AffectNet 7-way classification, we surpass the state-of-the-art methods by more than 1% on all data sets.

Results

Task	Dataset	Metric	Value	Model
Facial Recognition and Modelling	FER+	Accuracy	87.76	Local Learning Deep + BOW
Facial Recognition and Modelling	FER2013	Accuracy	75.42	Local Learning Deep+BOW
Facial Recognition and Modelling	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
Facial Recognition and Modelling	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
Facial Recognition and Modelling	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
Face Reconstruction	FER+	Accuracy	87.76	Local Learning Deep + BOW
Face Reconstruction	FER2013	Accuracy	75.42	Local Learning Deep+BOW
Face Reconstruction	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
Face Reconstruction	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
Face Reconstruction	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
Facial Expression Recognition (FER)	FER+	Accuracy	87.76	Local Learning Deep + BOW
Facial Expression Recognition (FER)	FER2013	Accuracy	75.42	Local Learning Deep+BOW
Facial Expression Recognition (FER)	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
Facial Expression Recognition (FER)	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
Facial Expression Recognition (FER)	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D	FER+	Accuracy	87.76	Local Learning Deep + BOW
3D	FER2013	Accuracy	75.42	Local Learning Deep+BOW
3D	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
3D	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
3D Face Modelling	FER+	Accuracy	87.76	Local Learning Deep + BOW
3D Face Modelling	FER2013	Accuracy	75.42	Local Learning Deep+BOW
3D Face Modelling	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
3D Face Modelling	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
3D Face Modelling	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D Face Reconstruction	FER+	Accuracy	87.76	Local Learning Deep + BOW
3D Face Reconstruction	FER2013	Accuracy	75.42	Local Learning Deep+BOW
3D Face Reconstruction	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
3D Face Reconstruction	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D Face Reconstruction	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW

Abstract

Results

Task	Dataset	Metric	Value	Model
Facial Recognition and Modelling	FER+	Accuracy	87.76	Local Learning Deep + BOW
Facial Recognition and Modelling	FER2013	Accuracy	75.42	Local Learning Deep+BOW
Facial Recognition and Modelling	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
Facial Recognition and Modelling	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
Facial Recognition and Modelling	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
Face Reconstruction	FER+	Accuracy	87.76	Local Learning Deep + BOW
Face Reconstruction	FER2013	Accuracy	75.42	Local Learning Deep+BOW
Face Reconstruction	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
Face Reconstruction	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
Face Reconstruction	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
Facial Expression Recognition (FER)	FER+	Accuracy	87.76	Local Learning Deep + BOW
Facial Expression Recognition (FER)	FER2013	Accuracy	75.42	Local Learning Deep+BOW
Facial Expression Recognition (FER)	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
Facial Expression Recognition (FER)	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
Facial Expression Recognition (FER)	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D	FER+	Accuracy	87.76	Local Learning Deep + BOW
3D	FER2013	Accuracy	75.42	Local Learning Deep+BOW
3D	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
3D	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
3D Face Modelling	FER+	Accuracy	87.76	Local Learning Deep + BOW
3D Face Modelling	FER2013	Accuracy	75.42	Local Learning Deep+BOW
3D Face Modelling	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW
3D Face Modelling	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
3D Face Modelling	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D Face Reconstruction	FER+	Accuracy	87.76	Local Learning Deep + BOW
3D Face Reconstruction	FER2013	Accuracy	75.42	Local Learning Deep+BOW
3D Face Reconstruction	AffectNet	Accuracy (7 emotion)	63.31	CNNs and BOVW + local SVM
3D Face Reconstruction	AffectNet	Accuracy (8 emotion)	59.58	CNNs and BOVW + local SVM
3D Face Reconstruction	FERPlus	Accuracy(pretrained)	87.76	Local Learning Deep + BOW

Local Learning with Deep and Handcrafted Features for Facial Expression Recognition

Abstract

Results

Related Papers

Local Learning with Deep and Handcrafted Features for Facial Expression Recognition

Abstract

Results

Related Papers