Claude Instant 1.1 (few-shot, k=5)
Reported on 2 benchmarks across 2 tasks
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- 78.9best: 87.5 (Claude 2 (few-shot, k=5))
- Accuracy85.7best: 96.4 (GPT-4 (few-shot, k=25))