TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/TruthfulQA

Question Answering on TruthfulQA

Metric: % info (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕% info▼Extra DataPaperDate↕Code
1Alpaca 7B + Inference Time Intervention (ITI)97.7No---
2GPT-3 175B97.55NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
3LLaMA 7B + Inference Time Intervention (ITI)93.8No---
4GPT-J 6B89.96NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
5GPT-2 1.5B89.84NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
6Vicuna 7B + Inference Time Intervention (ITI)83.5No---
7UnifiedQA 3B64.5NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
8LLaMA 65B53NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
9LLaMA 33B48NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
10LLaMA 13B41NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
11LLaMA 7B29NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code