Qwen2idae-16x14B (4-shot)
Reported on 5 benchmarks across 5 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- Accuracy· 2024-01-0529.9best: 89.7 (Gemini 2.0 Flash Experimental)
- Accuracy· 2024-01-0548.6best: 96.6 (EG-CFG (DeepSeek-V3-0324))
Knowledge Base2 results
- Accuracy· 2024-01-0529.9best: 89.7 (Gemini 2.0 Flash Experimental)
- Accuracy· 2024-01-0529.9best: 89.7 (Gemini 2.0 Flash Experimental)
Reasoning1 result
- Accuracy· 2024-01-0529.9best: 89.7 (Gemini 2.0 Flash Experimental)