Fact Checking on AVeriTeC

Metric: Question Only score (higher is better)

LeaderboardDataset
Loading chart...
#ModelQuestion Only scoreExtra DataPaperDateCode
1HerO0.48NoHerO at AVeriTeC: The Herd of Open Large Languag...2024-10-16Code
2CTU AIC0.46NoAIC CTU system at AVeriTeC: Re-framing automated...2024-10-15Code
3InFact0.45No---