Legal Reasoning on LegalBench (Issue-spotting)

Metric: Balanced Accuracy (higher is better)

LeaderboardDataset
#ModelBalanced AccuracyExtra DataPaperDateCode
1GPT-482.9No---
2GPT-3.560.9No---
3Claude-158.1No---