Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | PaLM 2(few-shot, k=3, CoT) | 84.8 | No | PaLM 2 Technical Report | 2023-05-17 | Code |
| 2 | PaLM 2 (few-shot, k=3, Direct) | 78.7 | No | PaLM 2 Technical Report | 2023-05-17 | Code |
| 3 | PaLM 540B (few-shot, k=3) | 78.1 | No | BloombergGPT: A Large Language Model for Finance | 2023-03-30 | Code |
| 4 | BLOOM 176B (few-shot, k=3) | 72.47 | No | BloombergGPT: A Large Language Model for Finance | 2023-03-30 | Code |
| 5 | Bloomberg GPT (few-shot, k=3) | 69.66 | No | BloombergGPT: A Large Language Model for Finance | 2023-03-30 | Code |
| 6 | GPT-NeoX (few-shot, k=3) | 62.36 | No | BloombergGPT: A Large Language Model for Finance | 2023-03-30 | Code |
| 7 | Chinchilla-70B (few-shot, k=5) | 58.6 | No | Training Compute-Optimal Large Language Models | 2022-03-29 | Code |
| 8 | Gopher-280B (few-shot, k=5) | 48.3 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |