Yuhang Zhang, Chengrui Wang, Xu Ling, Weihong Deng
Noisy label Facial Expression Recognition (FER) is more challenging than traditional noisy label classification tasks due to the inter-class similarity and the annotation ambiguity. Recent works mainly tackle this problem by filtering out large-loss samples. In this paper, we explore dealing with noisy labels from a new feature-learning perspective. We find that FER models remember noisy samples by focusing on a part of the features that can be considered related to the noisy labels instead of learning from the whole features that lead to the latent truth. Inspired by that, we propose a novel Erasing Attention Consistency (EAC) method to suppress the noisy samples during the training process automatically. Specifically, we first utilize the flip semantic consistency of facial images to design an imbalanced framework. We then randomly erase input images and use flip attention consistency to prevent the model from focusing on a part of the features. EAC significantly outperforms state-of-the-art noisy label FER methods and generalizes well to other tasks with a large number of classes like CIFAR100 and Tiny-ImageNet. The code is available at https://github.com/zyh-uaiaaaa/Erasing-Attention-Consistency.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Facial Recognition and Modelling | FER+ | Accuracy | 89.64 | EAC |
| Facial Recognition and Modelling | RAF-DB | Overall Accuracy | 90.35 | EAC(ResNet-50) |
| Facial Recognition and Modelling | AffectNet | Accuracy (7 emotion) | 65.32 | EAC |
| Face Reconstruction | FER+ | Accuracy | 89.64 | EAC |
| Face Reconstruction | RAF-DB | Overall Accuracy | 90.35 | EAC(ResNet-50) |
| Face Reconstruction | AffectNet | Accuracy (7 emotion) | 65.32 | EAC |
| Facial Expression Recognition (FER) | FER+ | Accuracy | 89.64 | EAC |
| Facial Expression Recognition (FER) | RAF-DB | Overall Accuracy | 90.35 | EAC(ResNet-50) |
| Facial Expression Recognition (FER) | AffectNet | Accuracy (7 emotion) | 65.32 | EAC |
| 3D | FER+ | Accuracy | 89.64 | EAC |
| 3D | RAF-DB | Overall Accuracy | 90.35 | EAC(ResNet-50) |
| 3D | AffectNet | Accuracy (7 emotion) | 65.32 | EAC |
| 3D Face Modelling | FER+ | Accuracy | 89.64 | EAC |
| 3D Face Modelling | RAF-DB | Overall Accuracy | 90.35 | EAC(ResNet-50) |
| 3D Face Modelling | AffectNet | Accuracy (7 emotion) | 65.32 | EAC |
| 3D Face Reconstruction | FER+ | Accuracy | 89.64 | EAC |
| 3D Face Reconstruction | RAF-DB | Overall Accuracy | 90.35 | EAC(ResNet-50) |
| 3D Face Reconstruction | AffectNet | Accuracy (7 emotion) | 65.32 | EAC |