TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Human benchmark

Human benchmark

Reported on 7 benchmarks across 3 tasks · 1 paper · 6 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing3 results

  • Question AnsweringonCheGeKa
    Accuracy· 2022-10-23
    64.5
    SOTA
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813
  • Question AnsweringonMultiQ
    Accuracy· 2022-10-23
    91
    SOTA
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813
  • Question AnsweringonRuOpenBookQA
    Accuracy· 2022-10-23
    86.5
    SOTA
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813

Miscellaneous2 results

  • EthicsonEthics (per ethics)
    Accuracy· 2022-10-23
    67.6
    SOTA
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813
  • EthicsonEthics
    Accuracy· 2022-10-23
    52.9
    best: 68.6 (RuGPT-3 Large)
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813

Methodology2 results

  • Logical ReasoningonWinograd Automatic
    Accuracy· 2022-10-23
    87
    SOTA
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813
  • Logical ReasoningonRuWorldTree
    Accuracy · 2022-10-23
    83.7
    SOTA
    TAPE: Assessing Few-shot Russian Language UnderstandingarXiv:2210.12813