DeepSeek-r1
Reported on 1 benchmark across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Knowledge Base1 result
- Acc· 2025-01-22SOTA79.8best: 94.4 (Xolver)