TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Common Sense Reasoning/RuCoS

Common Sense Reasoning on RuCoS

Metric: EM (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕EM ▼Extra DataPaperDate↕Code
1Golden Transformer0.924No---
2Human Benchmark0.89NoRussianSuperGLUE: A Russian Language Understandi...2020-10-29Code
3YaLM 1.0B few-shot0.859No---
4ruT5-large-finetune0.764No---
5ruT5-base-finetune0.752No---
6ruBert-base finetune0.716No---
7ruRoberta-large finetune0.716No---
8RuGPT3XL few-shot0.665No---
9ruBert-large finetune0.658No---
10MT5 Large0.562NomT5: A massively multilingual pre-trained text-t...2020-10-22Code
11SBERT_Large0.351No---
12SBERT_Large_mt_ru_finetuning0.347No---
13RuBERT plain0.314No---
14Multilingual Bert0.29No---
15heuristic majority0.257NoUnreasonable Effectiveness of Rule-Based Heurist...2021-05-03-
16Baseline TF-IDF1.10.252NoRussianSuperGLUE: A Russian Language Understandi...2020-10-29Code
17Random weighted0.247NoUnreasonable Effectiveness of Rule-Based Heurist...2021-05-03-
18majority_class0.247NoUnreasonable Effectiveness of Rule-Based Heurist...2021-05-03-
19RuGPT3Medium0.224No---
20RuBERT conversational0.218No---
21RuGPT3Small0.204No---
22RuGPT3Large0.202No---