Text-To-SQL on Spider 2.0

Metric: Success Rate (higher is better)

LeaderboardDataset
Loading chart...
#ModelSuccess RateExtra DataPaperDateCode
1Spider-Agent + o1-preview17.03NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
2Spider-Agent + GPT-4o10.13NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
3Spider-Agent + Claude-3.5-Sonnect9.02NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
4Spider-Agent + GPT-48.86NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
5Spider-Agent + Qwen2.5-72B6.17NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
6Spider-Agent + DeepSeek-V2.55.22NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
7Spider-Agent + Gemini-Pro-1.52.53NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-
8Spider-Agent + Llama-3.1-405B2.21NoSpider 2.0: Evaluating Language Models on Real-W...2024-11-12-