Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Common Sense Reasoning
/
RuCoS
Common Sense Reasoning on RuCoS
Metric: EM (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
EM (best first)
EM (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
EM
▼
Extra Data
Paper
Date
↕
Code
1
Golden Transformer
0.924
No
-
-
-
2
Human Benchmark
0.89
No
RussianSuperGLUE: A Russian Language Understandi...
2020-10-29
Code
3
YaLM 1.0B few-shot
0.859
No
-
-
-
4
ruT5-large-finetune
0.764
No
-
-
-
5
ruT5-base-finetune
0.752
No
-
-
-
6
ruBert-base finetune
0.716
No
-
-
-
7
ruRoberta-large finetune
0.716
No
-
-
-
8
RuGPT3XL few-shot
0.665
No
-
-
-
9
ruBert-large finetune
0.658
No
-
-
-
10
MT5 Large
0.562
No
mT5: A massively multilingual pre-trained text-t...
2020-10-22
Code
11
SBERT_Large
0.351
No
-
-
-
12
SBERT_Large_mt_ru_finetuning
0.347
No
-
-
-
13
RuBERT plain
0.314
No
-
-
-
14
Multilingual Bert
0.29
No
-
-
-
15
heuristic majority
0.257
No
Unreasonable Effectiveness of Rule-Based Heurist...
2021-05-03
-
16
Baseline TF-IDF1.1
0.252
No
RussianSuperGLUE: A Russian Language Understandi...
2020-10-29
Code
17
Random weighted
0.247
No
Unreasonable Effectiveness of Rule-Based Heurist...
2021-05-03
-
18
majority_class
0.247
No
Unreasonable Effectiveness of Rule-Based Heurist...
2021-05-03
-
19
RuGPT3Medium
0.224
No
-
-
-
20
RuBERT conversational
0.218
No
-
-
-
21
RuGPT3Small
0.204
No
-
-
-
22
RuGPT3Large
0.202
No
-
-
-
#1
Golden Transformer
0.924
EM
No paper
#2
Human Benchmark
SOTA
0.89
EM
· 2020-10-29
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
Code
#3
YaLM 1.0B few-shot
0.859
EM
No paper
#4
ruT5-large-finetune
0.764
EM
No paper
#5
ruT5-base-finetune
0.752
EM
No paper
#6
ruBert-base finetune
0.716
EM
No paper
#7
ruRoberta-large finetune
0.716
EM
No paper
#8
RuGPT3XL few-shot
0.665
EM
No paper
#9
ruBert-large finetune
0.658
EM
No paper
#10
MT5 Large
SOTA
0.562
EM
· 2020-10-22
mT5: A massively multilingual pre-trained text-to-text transformer
Code
#11
SBERT_Large
0.351
EM
No paper
#12
SBERT_Large_mt_ru_finetuning
0.347
EM
No paper
#13
RuBERT plain
0.314
EM
No paper
#14
Multilingual Bert
0.29
EM
No paper
#15
heuristic majority
0.257
EM
· 2021-05-03
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
#16
Baseline TF-IDF1.1
0.252
EM
· 2020-10-29
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
Code
#17
Random weighted
0.247
EM
· 2021-05-03
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
#18
majority_class
0.247
EM
· 2021-05-03
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
#19
RuGPT3Medium
0.224
EM
No paper
#20
RuBERT conversational
0.218
EM
No paper
#21
RuGPT3Small
0.204
EM
No paper
#22
RuGPT3Large
0.202
EM
No paper