TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Knowledge Distillation/CIFAR-100

Knowledge Distillation on CIFAR-100

Metric: Top-1 Accuracy (%) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top-1 Accuracy (%)▼Extra DataPaperDate↕Code
1SRD (T:resnet-32x4, S:shufflenet-v2)79.86NoUnderstanding the Role of the Projector in Knowl...2023-03-20Code
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)78.76NoLogit Standardization in Knowledge Distillation2024-03-03Code
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)78.6NoMV-MR: multi-views and multi-representations for...2023-03-21Code
4resnet8x4 (T: resnet32x4 S: resnet8x4)78.28NoLogit Standardization in Knowledge Distillation2024-03-03Code
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])78.08NoKnowledge Distillation with the Reused Teacher C...2022-03-26Code
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)77.93NoImproving Knowledge Distillation via Regularizin...2023-05-26Code
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)77.68NoImproving Knowledge Distillation via Regularizin...2023-05-26Code
8resnet8x4 (T: resnet32x4 S: resnet8x4)77.5NoLumiNet: The Bright Side of Perceptual Knowledge...2023-10-05Code
9resnet8x4 (T: resnet32x4 S: resnet8x4)76.68NoInformation Theoretic Representation Distillation2021-12-01Code
10resnet8x4 (T: resnet32x4 S: resnet8x4)76.31NoKnowledge Distillation from A Stronger Teacher2022-05-21Code
11DKD++(T:resnet-32x4, S:resnet-8x4)76.28NoImproving Knowledge Distillation via Regularizin...2023-05-26Code
12resnet8x4 (T: resnet32x4 S: resnet8x4)76.15NoWasserstein Contrastive Representation Distillat...2020-12-15-
13ReviewKD++(T:WRN-40-2, S:WRN-40-1)75.66NoImproving Knowledge Distillation via Regularizin...2023-05-26Code
14resnet8x4 (T: resnet32x4 S: resnet8x4)75.63NoDistilling Knowledge via Knowledge Review2021-04-19Code
15resnet8x4 (T: resnet32x4 S: resnet8x4)75.51NoContrastive Representation Distillation2019-10-23Code
16vgg8 (T:vgg13 S:vgg8)74.93NoInformation Theoretic Representation Distillation2021-12-01Code
17vgg8 (T:vgg13 S:vgg8)74.84NoDistilling Knowledge via Knowledge Review2021-04-19Code
18vgg8 (T:vgg13 S:vgg8)74.72NoWasserstein Contrastive Representation Distillat...2020-12-15-
19vgg8 (T:vgg13 S:vgg8)74.29NoContrastive Representation Distillation2019-10-23Code
20resnet8x4 (T: resnet32x4 S: resnet8x4)73.33NoDistilling the Knowledge in a Neural Network2015-03-09Code
21vgg8 (T:vgg13 S:vgg8)72.98NoDistilling the Knowledge in a Neural Network2015-03-09Code
22KD++(T:resnet56, S:resnet20)72.53NoImproving Knowledge Distillation via Regularizin...2023-05-26Code
23resnet110 (T:resnet110 S:resnet20)71.99NoInformation Theoretic Representation Distillation2021-12-01Code
24resnet110 (T:resnet110 S:resnet20)71.88NoWasserstein Contrastive Representation Distillat...2020-12-15-
25resnet110 (T:resnet110 S:resnet20)71.56NoContrastive Representation Distillation2019-10-23Code
26DKD++(T:resnet50, S:mobilenetv2)70.82NoImproving Knowledge Distillation via Regularizin...2023-05-26Code
27resnet110 (T:resnet110 S:resnet20)70.67NoDistilling the Knowledge in a Neural Network2015-03-09Code