Qwen2idae-16x14B (4-shot)

Reported on 5 benchmarks across 5 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing2 results

Question AnsweringonMATH
Accuracy· 2024-01-05
29.9
best: 89.7 (Gemini 2.0 Flash Experimental)
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks arXiv:2401.02731
Code GenerationonMBPP
Accuracy· 2024-01-05
48.6
best: 96.6 (EG-CFG (DeepSeek-V3-0324))
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks arXiv:2401.02731

Knowledge Base2 results

Mathematical Question AnsweringonMATH
Accuracy· 2024-01-05
29.9
best: 89.7 (Gemini 2.0 Flash Experimental)
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks arXiv:2401.02731
Mathematical ReasoningonMATH
Accuracy· 2024-01-05
29.9
best: 89.7 (Gemini 2.0 Flash Experimental)
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks arXiv:2401.02731

Reasoning1 result

Math Word Problem SolvingonMATH
Accuracy· 2024-01-05
29.9
best: 89.7 (Gemini 2.0 Flash Experimental)
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks arXiv:2401.02731