TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/TruthfulQA

Question Answering on TruthfulQA

Metric: % true (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕% true▼Extra DataPaperDate↕Code
1Vicuna 7B + Inference Time Intervention (ITI)88.6No---
2Alpaca 7B + Inference Time Intervention (ITI)66.6No---
3LLaMA 65B57NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
4UnifiedQA 3B53.86NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
5LLaMA 33B52NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
6LLaMA 13B47NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
7LLaMA 7B + Inference Time Intervention (ITI)45.1No---
8LLaMA 7B33NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
9GPT-2 1.5B29.5NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
10GPT-J 6B26.68NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
11GPT-3 175B20.44NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code