SURE: SUrvey REcipes for building reliable and robust deep networks

Yuting Li, Yingyi Chen, Xuanlong Yu, Dexiong Chen, Xi Shen

2024-03-01CVPR 2024 1Image Classification Long-tail Learning Learning with noisy labels

Abstract

In this paper, we revisit techniques for uncertainty estimation within deep neural networks and consolidate a suite of techniques to enhance their reliability. Our investigation reveals that an integrated application of diverse techniques--spanning model regularization, classifier and optimization--substantially improves the accuracy of uncertainty predictions in image classification tasks. The synergistic effect of these techniques culminates in our novel SURE approach. We rigorously evaluate SURE against the benchmark of failure prediction, a critical testbed for uncertainty estimation efficacy. Our results showcase a consistently better performance than models that individually deploy each technique, across various datasets and model architectures. When applied to real-world challenges, such as data corruption, label noise, and long-tailed class distribution, SURE exhibits remarkable robustness, delivering results that are superior or on par with current state-of-the-art specialized methods. Particularly on Animal-10N and Food-101N for learning with noisy labels, SURE achieves state-of-the-art performance without any task-specific adjustments. This work not only sets a new benchmark for robust uncertainty estimation but also paves the way for its application in diverse, real-world scenarios where reliability is paramount. Our code is available at \url{https://yutingli0606.github.io/SURE/}.

Results

Task	Dataset	Metric	Value	Model
Image Classification	Food-101N	Accuracy	88	SURE(ResNet-50)
Image Classification	CIFAR-10-LT (ρ=10)	Error Rate	5.04	SURE(ResNet-32)
Image Classification	CIFAR-100-LT (ρ=50)	Error Rate	36.87	SURE(ResNet-32)
Image Classification	CIFAR-100-LT (ρ=10)	Error Rate	26.76	SURE(ResNet-32)
Image Classification	CIFAR-10-LT (ρ=50)	Error Rate	9.78	SURE(ResNet-32)
Image Classification	CIFAR-100-LT (ρ=100)	Error Rate	43.66	SURE(ResNet-32)
Image Classification	CIFAR-10-LT (ρ=100)	Error Rate	13.07	SURE(ResNet-32)
Image Classification	ANIMAL	Accuracy	89	SURE
Document Text Classification	ANIMAL	Accuracy	89	SURE
Few-Shot Image Classification	CIFAR-10-LT (ρ=10)	Error Rate	5.04	SURE(ResNet-32)
Few-Shot Image Classification	CIFAR-100-LT (ρ=50)	Error Rate	36.87	SURE(ResNet-32)
Few-Shot Image Classification	CIFAR-100-LT (ρ=10)	Error Rate	26.76	SURE(ResNet-32)
Few-Shot Image Classification	CIFAR-10-LT (ρ=50)	Error Rate	9.78	SURE(ResNet-32)
Few-Shot Image Classification	CIFAR-100-LT (ρ=100)	Error Rate	43.66	SURE(ResNet-32)
Few-Shot Image Classification	CIFAR-10-LT (ρ=100)	Error Rate	13.07	SURE(ResNet-32)
Generalized Few-Shot Classification	CIFAR-10-LT (ρ=10)	Error Rate	5.04	SURE(ResNet-32)
Generalized Few-Shot Classification	CIFAR-100-LT (ρ=50)	Error Rate	36.87	SURE(ResNet-32)
Generalized Few-Shot Classification	CIFAR-100-LT (ρ=10)	Error Rate	26.76	SURE(ResNet-32)
Generalized Few-Shot Classification	CIFAR-10-LT (ρ=50)	Error Rate	9.78	SURE(ResNet-32)
Generalized Few-Shot Classification	CIFAR-100-LT (ρ=100)	Error Rate	43.66	SURE(ResNet-32)
Generalized Few-Shot Classification	CIFAR-10-LT (ρ=100)	Error Rate	13.07	SURE(ResNet-32)
Long-tail Learning	CIFAR-10-LT (ρ=10)	Error Rate	5.04	SURE(ResNet-32)
Long-tail Learning	CIFAR-100-LT (ρ=50)	Error Rate	36.87	SURE(ResNet-32)
Long-tail Learning	CIFAR-100-LT (ρ=10)	Error Rate	26.76	SURE(ResNet-32)
Long-tail Learning	CIFAR-10-LT (ρ=50)	Error Rate	9.78	SURE(ResNet-32)
Long-tail Learning	CIFAR-100-LT (ρ=100)	Error Rate	43.66	SURE(ResNet-32)
Long-tail Learning	CIFAR-10-LT (ρ=100)	Error Rate	13.07	SURE(ResNet-32)
Generalized Few-Shot Learning	CIFAR-10-LT (ρ=10)	Error Rate	5.04	SURE(ResNet-32)
Generalized Few-Shot Learning	CIFAR-100-LT (ρ=50)	Error Rate	36.87	SURE(ResNet-32)
Generalized Few-Shot Learning	CIFAR-100-LT (ρ=10)	Error Rate	26.76	SURE(ResNet-32)
Generalized Few-Shot Learning	CIFAR-10-LT (ρ=50)	Error Rate	9.78	SURE(ResNet-32)
Generalized Few-Shot Learning	CIFAR-100-LT (ρ=100)	Error Rate	43.66	SURE(ResNet-32)
Generalized Few-Shot Learning	CIFAR-10-LT (ρ=100)	Error Rate	13.07	SURE(ResNet-32)

SURE: SUrvey REcipes for building reliable and robust deep networks

Abstract

Results

Related Papers

SURE: SUrvey REcipes for building reliable and robust deep networks

Abstract

Results

Related Papers