Zheng Li, YuXuan Li, Penghai Zhao, RenJie Song, Xiang Li, Jian Yang
Diffusion models have recently achieved astonishing performance in generating high-fidelity photo-realistic images. Given their huge success, it is still unclear whether synthetic images are applicable for knowledge distillation when real images are unavailable. In this paper, we extensively study whether and how synthetic images produced from state-of-the-art diffusion models can be used for knowledge distillation without access to real images, and obtain three key conclusions: (1) synthetic data from diffusion models can easily lead to state-of-the-art performance among existing synthesis-based distillation methods, (2) low-fidelity synthetic images are better teaching materials, and (3) relatively weak classifiers are better teachers. Code is available at https://github.com/zhengli97/DM-KD.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Few-Shot Learning | Stanford Cars | 12-shot Accuracy | 83.9 | Real-Guidance + CAL |
| Few-Shot Learning | Stanford Cars | 16-shot Accuracy | 88.3 | Real-Guidance + CAL |
| Few-Shot Learning | Stanford Cars | 4-shot Accuracy | 44.3 | Real-Guidance + CAL |
| Few-Shot Learning | Stanford Cars | 8-shot Accuracy | 73.1 | Real-Guidance + CAL |
| Few-Shot Learning | FGVC Aircraft | 12-shot Accuracy | 65.8 | Real-Guidance + CAL |
| Few-Shot Learning | FGVC Aircraft | 16-shot Accuracy | 72.5 | Real-Guidance + CAL |
| Few-Shot Learning | FGVC Aircraft | 4-shot Accuracy | 34.5 | Real-Guidance + CAL |
| Few-Shot Learning | FGVC Aircraft | 8-shot Accuracy | 54.6 | Real-Guidance + CAL |
| Few-Shot Learning | FGVC Aircraft | Harmonic mean | 34.5 | Real-Guidance + CAL |
| Few-Shot Learning | DTD | 12-shot Accuracy | 54.5 | Real-Guidance + CAL |
| Few-Shot Learning | DTD | 16-shot Accuracy | 57.4 | Real-Guidance + CAL |
| Few-Shot Learning | DTD | 4-shot Accuracy | 41.5 | Real-Guidance + CAL |
| Few-Shot Learning | DTD | 8-shot Accuracy | 50.6 | Real-Guidance + CAL |
| Meta-Learning | Stanford Cars | 12-shot Accuracy | 83.9 | Real-Guidance + CAL |
| Meta-Learning | Stanford Cars | 16-shot Accuracy | 88.3 | Real-Guidance + CAL |
| Meta-Learning | Stanford Cars | 4-shot Accuracy | 44.3 | Real-Guidance + CAL |
| Meta-Learning | Stanford Cars | 8-shot Accuracy | 73.1 | Real-Guidance + CAL |
| Meta-Learning | FGVC Aircraft | 12-shot Accuracy | 65.8 | Real-Guidance + CAL |
| Meta-Learning | FGVC Aircraft | 16-shot Accuracy | 72.5 | Real-Guidance + CAL |
| Meta-Learning | FGVC Aircraft | 4-shot Accuracy | 34.5 | Real-Guidance + CAL |
| Meta-Learning | FGVC Aircraft | 8-shot Accuracy | 54.6 | Real-Guidance + CAL |
| Meta-Learning | FGVC Aircraft | Harmonic mean | 34.5 | Real-Guidance + CAL |
| Meta-Learning | DTD | 12-shot Accuracy | 54.5 | Real-Guidance + CAL |
| Meta-Learning | DTD | 16-shot Accuracy | 57.4 | Real-Guidance + CAL |
| Meta-Learning | DTD | 4-shot Accuracy | 41.5 | Real-Guidance + CAL |
| Meta-Learning | DTD | 8-shot Accuracy | 50.6 | Real-Guidance + CAL |
| Classification | FGVC Aircraft | OOD Accuracy (%) | 17.7 | CAL + Real-Guidance |
| Classification | FGVC Aircraft | Top-1 Accuracy (%) | 71.7 | CAL + Real-Guidance |