Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Zhilu Zhang, Mert R. Sabuncu

2018-05-20NeurIPS 2018 12Image Classification Learning with noisy labels

Abstract

Deep neural networks (DNNs) have achieved tremendous success in a variety of applications across many disciplines. Yet, their superior performance comes with the expensive cost of requiring correctly annotated large-scale datasets. Moreover, due to DNNs' rich capacity, errors in training labels can hamper performance. To combat this problem, mean absolute error (MAE) has recently been proposed as a noise-robust alternative to the commonly-used categorical cross entropy (CCE) loss. However, as we show in this paper, MAE can perform poorly with DNNs and challenging datasets. Here, we present a theoretically grounded set of noise-robust loss functions that can be seen as a generalization of MAE and CCE. Proposed loss functions can be readily applied with any existing DNN architecture and algorithm, while yielding good performance in a wide range of noisy label scenarios. We report results from experiments conducted with CIFAR-10, CIFAR-100 and FASHION-MNIST datasets and synthetically generated noisy labels.

Results

Task	Dataset	Metric	Value	Model
Image Classification	CIFAR-10N-Random2	Accuracy (mean)	87.7	GCE
Image Classification	CIFAR-10N-Random3	Accuracy (mean)	87.58	GCE
Image Classification	CIFAR-10N-Aggregate	Accuracy (mean)	87.85	GCE
Image Classification	CIFAR-10N-Random1	Accuracy (mean)	87.61	GCE
Image Classification	CIFAR-100N	Accuracy (mean)	56.73	GCE
Image Classification	CIFAR-10N-Worst	Accuracy (mean)	80.66	GCE
Document Text Classification	CIFAR-10N-Random2	Accuracy (mean)	87.7	GCE
Document Text Classification	CIFAR-10N-Random3	Accuracy (mean)	87.58	GCE
Document Text Classification	CIFAR-10N-Aggregate	Accuracy (mean)	87.85	GCE
Document Text Classification	CIFAR-10N-Random1	Accuracy (mean)	87.61	GCE
Document Text Classification	CIFAR-100N	Accuracy (mean)	56.73	GCE
Document Text Classification	CIFAR-10N-Worst	Accuracy (mean)	80.66	GCE

Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Abstract

Results

Related Papers

Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Abstract

Results

Related Papers