Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Human benchmark | 67.6 | No | TAPE: Assessing Few-shot Russian Language Unders... | 2022-10-23 | Code |
| 2 | RuGPT-3 Small | 60.9 | No | TAPE: Assessing Few-shot Russian Language Unders... | 2022-10-23 | Code |
| 3 | RuGPT-3 Large | 44.9 | No | TAPE: Assessing Few-shot Russian Language Unders... | 2022-10-23 | Code |
| 4 | RuGPT-3 Medium | 44.1 | No | TAPE: Assessing Few-shot Russian Language Unders... | 2022-10-23 | Code |