TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/PeerQA

Question Answering on PeerQA

Metric: Prometheus-2 Answer Correctness (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Prometheus-2 Answer Correctness▲Extra DataPaperDate↕Code
1GPT-3.5-Turbo-0613-16k3.0408NoLanguage Models are Few-Shot Learners2020-05-28Code
2Command-R-v01-34B3.0571No---
3Llama-3-IT-8B-8k3.1102NoThe Llama 3 Herd of Models2024-07-31Code
4Llama-3-IT-8B-32k3.1673NoThe Llama 3 Herd of Models2024-07-31Code
5Mistral-v02-7B-32k3.4245NoMistral 7B2023-10-10Code
6GPT-4o-2024-08-06-128k3.4612NoGPT-4 Technical Report2023-03-15Code