Question Answering on QuALITY
Metric: Accuracy (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Claude 1.3 (5-shot) | 84.1 | No | - | - | - |
| 2 | Claude 2 (5-shot) | 83.2 | No | - | - | - |
| 3 | RAPTOR + GPT-4 (June 2023) | 82.6 | No | RAPTOR: Recursive Abstractive Processing for Tre... | 2024-01-31 | Code |
| 4 | Claude Instant 1.1 (5-shot) | 80.5 | No | - | - | - |