TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/Transfer Learning/BBH-nlp

Transfer Learning on BBH-nlp

Metric: Average (%) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Average (%)▼AugmentationsPaperDate↕Code
1Qwen2.5-72B86.3No---
2Jiutian-大模型86.1No---
3LLama-3-405B85.9No---
4Jiutian-57B84.07No---
5Qwen2-72B82.4No---
6LLama-3-70B81No---
7Flan-PaLM 540B (3-shot, fine-tuned, CoT + SC)78.4NoScaling Instruction-Finetuned Language Models2022-10-20Code
8PaLM 540B (CoT + self-consistency)78.2NoScaling Instruction-Finetuned Language Models2022-10-20Code
9code-davinci-002 175B (CoT)73.5NoEvaluating Large Language Models Trained on Code2021-07-07Code
10Flan-PaLM 540B (3-shot, fine-tuned, CoT)72.4NoScaling Instruction-Finetuned Language Models2022-10-20Code
11PaLM 540B (CoT)71.2NoScaling Instruction-Finetuned Language Models2022-10-20Code
12Flan-PaLM 540B (5-shot, finetuned)70NoScaling Instruction-Finetuned Language Models2022-10-20Code
13PaLM 540B62.7NoScaling Instruction-Finetuned Language Models2022-10-20Code
14Orca 2-13B50.18NoOrca 2: Teaching Small Language Models How to Re...2023-11-18-
15Orca 2-7B45.93NoOrca 2: Teaching Small Language Models How to Re...2023-11-18-