Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/ruRoberta-large finetune

ruRoberta-large finetune

Reported on 12 benchmarks across 5 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing12 results

Reading ComprehensiononMuSeRC
Average F1
0.83
best: 0.941 (Golden Transformer)
Reading ComprehensiononMuSeRC
EM
0.561
best: 0.819 (Golden Transformer)
Question AnsweringonDaNetQA
Accuracy
0.82
best: 0.917 (Golden Transformer)
Common Sense ReasoningonRWSD
Accuracy
0.571
best: 0.84 (Human Benchmark)
Common Sense ReasoningonPARus
Accuracy
0.508
best: 0.982 (Human Benchmark)
Common Sense ReasoningonRuCoS
Average F1
0.73
best: 0.93 (Human Benchmark)
Common Sense ReasoningonRuCoS
EM
0.716
best: 0.924 (Golden Transformer)
Word Sense DisambiguationonRUSSE
Accuracy
0.715
best: 0.805 (Human Benchmark)
Natural Language InferenceonRCB
Accuracy
0.518
best: 0.702 (Human Benchmark)
Natural Language InferenceonRCB
Average F1
0.357
best: 0.68 (Human Benchmark)
Natural Language InferenceonLiDiRus
MCC
0.339
best: 0.626 (Human Benchmark)
Natural Language InferenceonTERRa
Accuracy
0.801
best: 0.92 (Human Benchmark)