TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/resnet8x4 (T: resnet32x4 S: resnet8x4)

resnet8x4 (T: resnet32x4 S: resnet8x4)

Reported on 1 benchmark across 1 task · 7 papers · 4 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing7 results

  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2021-12-01
    76.68
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    SOTA
    Information Theoretic Representation DistillationarXiv:2112.00459
  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2020-12-15
    76.15
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    SOTA
    Wasserstein Contrastive Representation DistillationarXiv:2012.08674
  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2019-10-23
    75.51
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    SOTA
    Contrastive Representation DistillationarXiv:1910.10699
  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2015-03-09
    73.33
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    SOTA
    Distilling the Knowledge in a Neural NetworkarXiv:1503.02531
  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2023-10-05
    77.5
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    LumiNet: The Bright Side of Perceptual Knowledge DistillationarXiv:2310.03669
  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2022-05-21
    76.31
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    Knowledge Distillation from A Stronger TeacherarXiv:2205.10536
  • Knowledge DistillationonCIFAR-100
    Top-1 Accuracy (%)· 2021-04-19
    75.63
    best: 79.86 (SRD (T:resnet-32x4, S:shufflenet-v2))
    Distilling Knowledge via Knowledge ReviewarXiv:2104.09044