TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/TruthfulQA

Question Answering on TruthfulQA

Metric: MC2 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕MC2▼Extra DataPaperDate↕Code
1Mistral-7B-Instruct-v0.2 + TruthX0.75NoTruthX: Alleviating Hallucinations by Editing La...2024-02-27Code
2LLaMa-2-7B-Chat + TruthX0.74NoTruthX: Alleviating Hallucinations by Editing La...2024-02-27Code
3GPT-2 1.5B0.39NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
4GPT-J 6B0.36NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
5UnifiedQA 3B0.35NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code
6GPT-3 175B0.33NoTruthfulQA: Measuring How Models Mimic Human Fal...2021-09-08Code