TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Common Sense Reasoning/RuCoS

Common Sense Reasoning on RuCoS

Metric: Average F1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Average F1▼Extra DataPaperDate↕Code
1Human Benchmark0.93NoRussianSuperGLUE: A Russian Language Understandi...2020-10-29Code
2Golden Transformer0.92No---
3YaLM 1.0B few-shot0.86No---
4ruT5-large-finetune0.81No---
5ruT5-base-finetune0.79No---
6ruBert-base finetune0.74No---
7ruRoberta-large finetune0.73No---
8ruBert-large finetune0.68No---
9RuGPT3XL few-shot0.67No---
10MT5 Large0.57NomT5: A massively multilingual pre-trained text-t...2020-10-22Code
11SBERT_Large0.36No---
12SBERT_Large_mt_ru_finetuning0.35No---
13RuBERT plain0.32No---
14Multilingual Bert0.29No---
15heuristic majority0.26NoUnreasonable Effectiveness of Rule-Based Heurist...2021-05-03-
16Baseline TF-IDF1.10.26NoRussianSuperGLUE: A Russian Language Understandi...2020-10-29Code
17Random weighted0.25NoUnreasonable Effectiveness of Rule-Based Heurist...2021-05-03-
18majority_class0.25NoUnreasonable Effectiveness of Rule-Based Heurist...2021-05-03-
19RuGPT3Medium0.23No---
20RuBERT conversational0.22No---
21RuGPT3Small0.21No---
22RuGPT3Large0.21No---