Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/SBERT_Large_mt_ru_finetuning

SBERT_Large_mt_ru_finetuning

Reported on 12 benchmarks across 5 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing12 results

Reading ComprehensiononMuSeRC
Average F1
0.642
best: 0.941 (Golden Transformer)
Reading ComprehensiononMuSeRC
EM
0.319
best: 0.819 (Golden Transformer)
Question AnsweringonDaNetQA
Accuracy
0.697
best: 0.917 (Golden Transformer)
Common Sense ReasoningonRWSD
Accuracy
0.675
best: 0.84 (Human Benchmark)
Common Sense ReasoningonPARus
Accuracy
0.498
best: 0.982 (Human Benchmark)
Common Sense ReasoningonRuCoS
Average F1
0.35
best: 0.93 (Human Benchmark)
Common Sense ReasoningonRuCoS
EM
0.347
best: 0.924 (Golden Transformer)
Word Sense DisambiguationonRUSSE
Accuracy
0.657
best: 0.805 (Human Benchmark)
Natural Language InferenceonRCB
Accuracy
0.486
best: 0.702 (Human Benchmark)
Natural Language InferenceonRCB
Average F1
0.351
best: 0.68 (Human Benchmark)
Natural Language InferenceonLiDiRus
MCC
0.218
best: 0.626 (Human Benchmark)
Natural Language InferenceonTERRa
Accuracy
0.637
best: 0.92 (Human Benchmark)