TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/T5-base

T5-base

Reported on 89 benchmarks across 8 tasks · 2 papers · 36 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing87 results

  • Question AnsweringonKILT: ELI5
    Rouge-L· 2020-09-04
    19.08
    best: 27.13 (RBG)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: TriviaQA
    EM· 2020-09-04
    18.11
    best: 76.27 (Re2G)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: TriviaQA
    F1· 2020-09-04
    27.83
    best: 81.4 (Re2G)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: Natural Questions
    EM· 2020-09-04
    19.6
    best: 53.74 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: Natural Questions
    F1· 2020-09-04
    27.73
    best: 62.24 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: HotpotQA
    EM· 2020-09-04
    12.64
    best: 40.46 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: HotpotQA
    F1· 2020-09-04
    19.57
    best: 51.44 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: ELI5
    ROUGE-L· 2020-09-04
    19.08
    best: 24.53 (somebody)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-WIKI
    Accuracy· 2020-09-04
    47.13
    best: 87.44 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-WIKI
    KILT-AC· 2020-09-04
    47.13
    best: 87.44 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-WIKI
    R-Prec· 2020-09-04
    47.13
    best: 88.12 (chriskuei)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-WIKI
    Recall@5· 2020-09-04
    47.13
    best: 95.62 (chriskuei)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: AIDA-YAGO2
    Accuracy· 2020-09-04
    74.05
    best: 89.85 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: AIDA-YAGO2
    KILT-AC· 2020-09-04
    74.05
    best: 89.85 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: AIDA-YAGO2
    R-Prec· 2020-09-04
    74.05
    best: 89.98 (chriskuei)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: AIDA-YAGO2
    Recall@5· 2020-09-04
    74.05
    best: 94.85 (chriskuei)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-CWEB
    Accuracy· 2020-09-04
    49.29
    best: 71.22 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-CWEB
    KILT-AC· 2020-09-04
    49.29
    best: 71.22 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-CWEB
    R-Prec· 2020-09-04
    49.29
    best: 71.22 (GENRE)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Entity LinkingonKILT: WNED-CWEB
    Recall@5· 2020-09-04
    49.29
    best: 81.76 (BLINK)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Slot FillingonKILT: T-REx
    Accuracy· 2020-09-04
    43.56
    best: 87.68 (Re2G)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Slot FillingonKILT: T-REx
    F1· 2020-09-04
    50.61
    best: 89.93 (Re2G)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Slot FillingonKILT: Zero Shot RE
    Accuracy· 2020-09-04
    9.02
    best: 74.63 (single ngram)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Slot FillingonKILT: Zero Shot RE
    F1· 2020-09-04
    13.52
    best: 79.66 (single ngram)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: TriviaQA
    EM· 2020-09-04
    18.11
    best: 76.27 (Re2G)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: TriviaQA
    F1· 2020-09-04
    27.83
    best: 81.4 (Re2G)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: Natural Questions
    EM· 2020-09-04
    19.6
    best: 53.74 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: Natural Questions
    F1· 2020-09-04
    27.73
    best: 62.24 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: HotpotQA
    EM· 2020-09-04
    12.64
    best: 40.46 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: HotpotQA
    F1· 2020-09-04
    19.57
    best: 51.44 (intersect)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: ELI5
    F1· 2020-09-04
    16.1
    best: 27.13 (somebody)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain Question AnsweringonKILT: ELI5
    ROUGE-L· 2020-09-04
    19.08
    best: 24.53 (somebody)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain DialogonKILT: Wizard of Wikipedia
    F1· 2020-09-04
    13.53
    best: 19.19 (Hindsight)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Open-Domain DialogonKILT: Wizard of Wikipedia
    ROUGE-L· 2020-09-04
    12.4
    best: 17.06 (Hindsight)
    SOTA
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Reading ComprehensiononPhotoChat
    Precision· 2019-10-23
    58.2
    best: 63.3 (PaCE)
    SOTA
    Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerarXiv:1910.10683
  • Question AnsweringonKILT: ELI5
    F1· 2020-09-04
    16.1
    best: 27.13 (somebody)
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Question AnsweringonKILT: ELI5
    F1· 2020-09-04
    16.1
    best: 27.13 (somebody)
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Fact VerificationonKILT: FEVER
    Accuracy· 2020-09-04
    76.3
    best: 89.55 (Re2G)
    KILT: a Benchmark for Knowledge Intensive Language TasksarXiv:2009.02252
  • Reading ComprehensiononPhotoChat
    F1· 2019-10-23
    58.1
    best: 63.8 (PaCE)
    Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerarXiv:1910.10683
  • Reading ComprehensiononPhotoChat
    Recall· 2019-10-23
    57.9
    best: 68 (PaCE)
    Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerarXiv:1910.10683
  • Question AnsweringonKILT: TriviaQA
    KILT-EM
    0
    best: 57.91 (Re2G)
  • Question AnsweringonKILT: TriviaQA
    KILT-F1
    0
    best: 61.78 (Re2G)
  • Question AnsweringonKILT: TriviaQA
    R-Prec
    0
    best: 72.68 (Re2G)
  • Question AnsweringonKILT: TriviaQA
    Recall@5
    0
    best: 76.36 (intersect)
  • Question AnsweringonKILT: Natural Questions
    KILT-EM
    0
    best: 43.56 (Re2G)
  • Question AnsweringonKILT: Natural Questions
    KILT-F1
    0
    best: 49.8 (Re2G)
  • Question AnsweringonKILT: Natural Questions
    R-Prec
    0
    best: 70.78 (Re2G)
  • Question AnsweringonKILT: Natural Questions
    Recall@5
    0
    best: 76.63 (Re2G)
  • Question AnsweringonKILT: HotpotQA
    KILT-EM
    0
    best: 18.06 (intersect)
  • Question AnsweringonKILT: HotpotQA
    KILT-F1
    0
    best: 21.42 (intersect)
  • Question AnsweringonKILT: HotpotQA
    R-Prec
    0
    best: 58.83 (intersect)
  • Question AnsweringonKILT: HotpotQA
    Recall@5
    0
    best: 51.03 (intersect)
  • Question AnsweringonKILT: ELI5
    KILT-F1
    0
    best: 3 (somebody)
  • Question AnsweringonKILT: ELI5
    KILT-RL
    0
    best: 2.62 (somebody)
  • Question AnsweringonKILT: ELI5
    R-Prec
    0
    best: 18.33 (TABi)
  • Question AnsweringonKILT: ELI5
    Recall@5
    0
    best: 28.21 (TABi)
  • Slot FillingonKILT: T-REx
    KILT-AC
    0
    best: 75.84 (Re2G)
  • Slot FillingonKILT: T-REx
    KILT-F1
    0
    best: 77.05 (Re2G)
  • Slot FillingonKILT: T-REx
    R-Prec
    0
    best: 81.9 (TABi)
  • Slot FillingonKILT: T-REx
    Recall@5
    0
    best: 89.36 (TABi)
  • Slot FillingonKILT: Zero Shot RE
    KILT-AC
    0
    best: 73.2 (single ngram)
  • Slot FillingonKILT: Zero Shot RE
    KILT-F1
    0
    best: 78.12 (single ngram)
  • Slot FillingonKILT: Zero Shot RE
    R-Prec
    0
    best: 98.49 (KGI_1)
  • Slot FillingonKILT: Zero Shot RE
    Recall@5
    0
    best: 99.34 (single ngram)
  • Fact VerificationonKILT: FEVER
    KILT-AC
    0
    best: 78.53 (Re2G)
  • Fact VerificationonKILT: FEVER
    R-Prec
    0
    best: 88.92 (Re2G)
  • Fact VerificationonKILT: FEVER
    Recall@5
    0
    best: 92.52 (Re2G)
  • Open-Domain Question AnsweringonKILT: TriviaQA
    KILT-EM
    0
    best: 57.91 (Re2G)
  • Open-Domain Question AnsweringonKILT: TriviaQA
    KILT-F1
    0
    best: 61.78 (Re2G)
  • Open-Domain Question AnsweringonKILT: TriviaQA
    R-Prec
    0
    best: 72.68 (Re2G)
  • Open-Domain Question AnsweringonKILT: TriviaQA
    Recall@5
    0
    best: 76.36 (intersect)
  • Open-Domain Question AnsweringonKILT: Natural Questions
    KILT-EM
    0
    best: 43.56 (Re2G)
  • Open-Domain Question AnsweringonKILT: Natural Questions
    KILT-F1
    0
    best: 49.8 (Re2G)
  • Open-Domain Question AnsweringonKILT: Natural Questions
    R-Prec
    0
    best: 70.78 (Re2G)
  • Open-Domain Question AnsweringonKILT: Natural Questions
    Recall@5
    0
    best: 76.63 (Re2G)
  • Open-Domain Question AnsweringonKILT: HotpotQA
    KILT-EM
    0
    best: 18.06 (intersect)
  • Open-Domain Question AnsweringonKILT: HotpotQA
    KILT-F1
    0
    best: 21.42 (intersect)
  • Open-Domain Question AnsweringonKILT: HotpotQA
    R-Prec
    0
    best: 58.83 (intersect)
  • Open-Domain Question AnsweringonKILT: HotpotQA
    Recall@5
    0
    best: 51.03 (intersect)
  • Open-Domain Question AnsweringonKILT: ELI5
    KILT-F1
    0
    best: 3 (somebody)
  • Open-Domain Question AnsweringonKILT: ELI5
    KILT-RL
    0
    best: 2.62 (somebody)
  • Open-Domain Question AnsweringonKILT: ELI5
    R-Prec
    0
    best: 18.33 (TABi)
  • Open-Domain Question AnsweringonKILT: ELI5
    Recall@5
    0
    best: 28.21 (TABi)
  • Open-Domain DialogonKILT: Wizard of Wikipedia
    KILT-F1
    0
    best: 13.39 (Hindsight)
  • Open-Domain DialogonKILT: Wizard of Wikipedia
    KILT-RL
    0
    best: 11.92 (Hindsight)
  • Open-Domain DialogonKILT: Wizard of Wikipedia
    R-Prec
    0
    best: 64.79 (chriskuei)
  • Open-Domain DialogonKILT: Wizard of Wikipedia
    Recall@5
    0
    best: 82.15 (chriskuei)

Miscellaneous3 results

  • Intent RecognitiononPhotoChat
    Precision· 2019-10-23
    58.2
    best: 63.3 (PaCE)
    SOTA
    Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerarXiv:1910.10683
  • Intent RecognitiononPhotoChat
    F1· 2019-10-23
    58.1
    best: 63.8 (PaCE)
    Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerarXiv:1910.10683
  • Intent RecognitiononPhotoChat
    Recall· 2019-10-23
    57.9
    best: 68 (PaCE)
    Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerarXiv:1910.10683