Christoph Reinders, Frederik Schubert, Bodo Rosenhahn
Deep convolutional neural networks require large amounts of labeled data samples. For many real-world applications, this is a major limitation which is commonly treated by augmentation methods. In this work, we address the problem of learning deep neural networks on small datasets. Our proposed architecture called ChimeraMix learns a data augmentation by generating compositions of instances. The generative model encodes images in pairs, combines the features guided by a mask, and creates new samples. For evaluation, all methods are trained from scratch without any additional data. Several experiments on benchmark datasets, e.g. ciFAIR-10, STL-10, and ciFAIR-100, demonstrate the superior performance of ChimeraMix compared to current state-of-the-art methods for classification on small datasets.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Classification | CIFAR-10, 500 Labels | Accuracy (%) | 70.09 | ChimeraMix+AutoAugment |
| Image Classification | CIFAR-10, 500 Labels | Accuracy (%) | 67.3 | ChimeraMix |
| Image Classification | CIFAR-10, 1000 Labels | Accuracy (%) | 76.76 | ChimeraMix+AutoAugment |
| Image Classification | CIFAR-10, 1000 Labels | Accuracy (%) | 74.96 | ChimeraMix |
| Image Classification | CIFAR-100, 1000 Labels | Accuracy | 35.02 | ChimeraMix+AutoAugment |
| Image Classification | CIFAR-100, 1000 Labels | Accuracy | 32.72 | ChimeraMix |
| Image Classification | CIFAR-10, 100 Labels | Accuracy (%) | 49.75 | ChimeraMix+AutoAugment |
| Image Classification | CIFAR-10, 100 Labels | Accuracy (%) | 47.6 | ChimeraMix |
| Image Classification | ciFAIR-10 50 samples per class | Accuracy | 70.09 | ChimeraMix+AutoAugment |
| Image Classification | ciFAIR-10 50 samples per class | Accuracy | 67.3 | ChimeraMix |