TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Local Learning with Deep and Handcrafted Features for Faci...

Local Learning with Deep and Handcrafted Features for Facial Expression Recognition

Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Marius Popescu

2018-04-29Facial Expression RecognitionFacial Expression Recognition (FER)General Classification
PaperPDF

Abstract

We present an approach that combines automatic features learned by convolutional neural networks (CNN) and handcrafted features computed by the bag-of-visual-words (BOVW) model in order to achieve state-of-the-art results in facial expression recognition. To obtain automatic features, we experiment with multiple CNN architectures, pre-trained models and training procedures, e.g. Dense-Sparse-Dense. After fusing the two types of features, we employ a local learning framework to predict the class label for each test image. The local learning framework is based on three steps. First, a k-nearest neighbors model is applied in order to select the nearest training samples for an input test image. Second, a one-versus-all Support Vector Machines (SVM) classifier is trained on the selected training samples. Finally, the SVM classifier is used to predict the class label only for the test image it was trained for. Although we have used local learning in combination with handcrafted features in our previous work, to the best of our knowledge, local learning has never been employed in combination with deep features. The experiments on the 2013 Facial Expression Recognition (FER) Challenge data set, the FER+ data set and the AffectNet data set demonstrate that our approach achieves state-of-the-art results. With a top accuracy of 75.42% on FER 2013, 87.76% on the FER+, 59.58% on AffectNet 8-way classification and 63.31% on AffectNet 7-way classification, we surpass the state-of-the-art methods by more than 1% on all data sets.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingFER+Accuracy87.76Local Learning Deep + BOW
Facial Recognition and ModellingFER2013Accuracy75.42Local Learning Deep+BOW
Facial Recognition and ModellingAffectNetAccuracy (7 emotion)63.31CNNs and BOVW + local SVM
Facial Recognition and ModellingAffectNetAccuracy (8 emotion)59.58CNNs and BOVW + local SVM
Facial Recognition and ModellingFERPlusAccuracy(pretrained)87.76Local Learning Deep + BOW
Face ReconstructionFER+Accuracy87.76Local Learning Deep + BOW
Face ReconstructionFER2013Accuracy75.42Local Learning Deep+BOW
Face ReconstructionAffectNetAccuracy (7 emotion)63.31CNNs and BOVW + local SVM
Face ReconstructionAffectNetAccuracy (8 emotion)59.58CNNs and BOVW + local SVM
Face ReconstructionFERPlusAccuracy(pretrained)87.76Local Learning Deep + BOW
Facial Expression Recognition (FER)FER+Accuracy87.76Local Learning Deep + BOW
Facial Expression Recognition (FER)FER2013Accuracy75.42Local Learning Deep+BOW
Facial Expression Recognition (FER)FERPlusAccuracy(pretrained)87.76Local Learning Deep + BOW
Facial Expression Recognition (FER)AffectNetAccuracy (7 emotion)63.31CNNs and BOVW + local SVM
Facial Expression Recognition (FER)AffectNetAccuracy (8 emotion)59.58CNNs and BOVW + local SVM
3DFER+Accuracy87.76Local Learning Deep + BOW
3DFER2013Accuracy75.42Local Learning Deep+BOW
3DAffectNetAccuracy (7 emotion)63.31CNNs and BOVW + local SVM
3DAffectNetAccuracy (8 emotion)59.58CNNs and BOVW + local SVM
3DFERPlusAccuracy(pretrained)87.76Local Learning Deep + BOW
3D Face ModellingFER+Accuracy87.76Local Learning Deep + BOW
3D Face ModellingFER2013Accuracy75.42Local Learning Deep+BOW
3D Face ModellingFERPlusAccuracy(pretrained)87.76Local Learning Deep + BOW
3D Face ModellingAffectNetAccuracy (7 emotion)63.31CNNs and BOVW + local SVM
3D Face ModellingAffectNetAccuracy (8 emotion)59.58CNNs and BOVW + local SVM
3D Face ReconstructionFER+Accuracy87.76Local Learning Deep + BOW
3D Face ReconstructionFER2013Accuracy75.42Local Learning Deep+BOW
3D Face ReconstructionAffectNetAccuracy (7 emotion)63.31CNNs and BOVW + local SVM
3D Face ReconstructionAffectNetAccuracy (8 emotion)59.58CNNs and BOVW + local SVM
3D Face ReconstructionFERPlusAccuracy(pretrained)87.76Local Learning Deep + BOW

Related Papers

Multimodal Prompt Alignment for Facial Expression Recognition2025-06-26Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation2025-06-25Using Vision Language Models to Detect Students' Academic Emotion through Facial Expressions2025-06-12EfficientFER: EfficientNetv2 Based Deep Learning Approach for Facial Expression Recognition2025-06-02TKFNet: Learning Texture Key Factor Driven Feature for Facial Expression Recognition2025-05-15Unsupervised Multiview Contrastive Language-Image Joint Learning with Pseudo-Labeled Prompts Via Vision-Language Model for 3D/4D Facial Expression Recognition2025-05-14Achieving 3D Attention via Triplet Squeeze and Excitation Block2025-05-09Some Optimizers are More Equal: Understanding the Role of Optimizers in Group Fairness2025-04-21