Data-free Knowledge Distillation on Wiki-40B

Metric: Perplexity (lower is better)

LeaderboardDataset
Loading chart...
#ModelPerplexityExtra DataPaperDateCode
1OutEffHop-Bert_base6.209NoOutlier-Efficient Hopfield Layers for Large Tran...2024-04-04Code