TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/CoT-T5-11B (1024 Shot)

CoT-T5-11B (1024 Shot)

Reported on 7 benchmarks across 3 tasks · 1 paper · 6 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Methodology6 results

  • Few-Shot LearningonPubMedQA
    Accuracy· 2023-05-23
    73.42
    best: 77.9 (MetaGen Blended RAG (zero-shot))
    SOTA
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045
  • Few-Shot LearningonCaseHOLD
    Accuracy· 2023-05-23
    68.3
    SOTA
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045
  • Few-Shot LearningonMedNLI
    Accuracy· 2023-05-23
    78.02
    SOTA
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045
  • Meta-LearningonPubMedQA
    Accuracy· 2023-05-23
    73.42
    best: 77.9 (MetaGen Blended RAG (zero-shot))
    SOTA
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045
  • Meta-LearningonCaseHOLD
    Accuracy· 2023-05-23
    68.3
    SOTA
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045
  • Meta-LearningonMedNLI
    Accuracy· 2023-05-23
    78.02
    SOTA
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045

Natural Language Processing1 result

  • Question AnsweringonPubMedQA
    Accuracy· 2023-05-23
    73.42
    best: 81.6 (Meditron-70B (CoT + SC))
    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningarXiv:2305.14045