TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Text-To-SQL/BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)

Text-To-SQL on BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation)

Metric: Execution Accuracy % (Dev) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Execution Accuracy % (Dev)▼Extra DataPaperDate↕Code
1DSAIR + GPT-4o74.32No---
2XiYan-SQL73.34NoA Preview of XiYan-SQL: A Multi-Generator Ensemb...2024-11-13Code
3CHASE-SQL + Gemini73.14NoCHASE-SQL: Multi-Path Reasoning and Preference O...2024-10-02-
4ExSL + granite-34b-code72.43No---
5Insights AI72.16No---
6OpenSearch-SQL+ v2 + GPT-4o69.3No---
7MCTS-SQL68.91No---
8PURPLE + RED + GPT-4o68.12No---
9Arcwise + GPT-4o67.99No---
10Distillery + GPT-4o67.21NoThe Death of Schema Linking? Text-to-SQL in the ...2024-08-14-
11RECAP + Gemini66.95No---
12MSL-SQL + DeepSeek-V2.566.82No---
13MSc-SQL65.6NoMSc-SQL: Multi-Sample Critiquing Small Language ...2024-10-16Code
14ByteBrain65.45No---
15ExSL + granite-20b-code65.38No---
16CHESS65NoCHESS: Contextual Harnessing for Efficient SQL S...2024-05-27Code
17SCL-SQL64.73No---
18SFT CodeS-15B + SQLFixAgent64.62No---
19MCS-SQL + GPT-463.36No---
20PURPLE + GPT-4o62.97No---
21GRA-SQL62.58No---
22OpenSearch-SQL v1 + GPT-461.34No---
23PB-SQL v160.5No---
24Dubo-SQL, v159.71No---
25SuperSQL58.5No---
26SFT CodeS-15B58.47No---
27MAC-SQL + GPT-457.56NoMAC-SQL: A Multi-Agent Collaborative Framework f...2023-12-18Code
28SFT CodeS-7B57.17No---
29SENSE-13B55.48No---
30SENSE55.48No---
31DAIL-SQL + GPT-454.76NoText-to-SQL Empowered by Large Language Models: ...2023-08-29Code
32DIN-SQL + GPT-450.72NoDIN-SQL: Decomposed In-Context Learning of Text-...2023-04-21Code
33DELLM + MAC-SQL48.92NoKnowledge-to-SQL: Enhancing SQL Generation with ...2024-02-18Code
34GPT-4 (Baseline)46.35NoCan LLMs Effectively Leverage Graph Structural I...2023-09-28Code
35Claude-2 (Baseline)42.7NoCan LLMs Effectively Leverage Graph Structural I...2023-09-28Code
36Open SQL-7B37.68No---
37ChatGPT (Baseline)37.22NoCan LLM Already Serve as A Database Interface? A...2023-05-04Code
38CoT + ChatGPT36.64NoCan LLM Already Serve as A Database Interface? A...2023-05-04Code
39Codex (Baseline)34.35NoCan LLM Already Serve as A Database Interface? A...2023-05-04Code
40Palm-2 (Baseline)27.38NoCan LLM Already Serve as A Database Interface? A...2023-05-04Code