TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL w...

Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types

Ziming Guo, Chao Ma, Yinggang Sun, Tiancheng Zhao, Guangyao Wang, Hai Huang

2024-12-21Text-To-SQLNavigateMMSQL performance
PaperPDFCode(official)

Abstract

Recent advancements in large language models (LLMs) have significantly advanced text-to-SQL systems. However, most LLM-based methods often narrowly focus on SQL generation, neglecting the complexities of real-world conversational queries. This oversight can lead to unreliable responses, particularly for ambiguous questions that cannot be directly addressed with SQL. To bridge this gap, we propose MMSQL, a comprehensive test suite designed to evaluate the question classification and SQL generation capabilities of LLMs by simulating real-world scenarios with diverse question types and multi-turn Q\&A interactions. Using MMSQL, we assessed the performance of popular LLMs, including both open-source and closed-source models, and identified key factors impacting their performance in such scenarios. Moreover, we introduce an LLM-based multi-agent framework that employs specialized agents to identify question types and determine appropriate answering strategies. Our experiments demonstrate that this approach significantly enhances the model's ability to navigate the complexities of conversational dynamics, effectively handling the diverse and complex nature of user queries.

Results

TaskDatasetMetricValueModel
Semantic ParsingMMSQLTDEX67GPT-4 Turbo
Semantic ParsingMMSQLTDEX65.8Gemini-1.5 Flash
Semantic ParsingMMSQLTDEX64.1GPT-3.5 Turbo
Semantic ParsingMMSQLTDEX64Llama3-8B
Semantic ParsingMMSQLTDEX62.8Llama3-70B
Semantic ParsingMMSQLTDEX30.7SQLCoder-8B
Text-To-SQLMMSQLTDEX67GPT-4 Turbo
Text-To-SQLMMSQLTDEX65.8Gemini-1.5 Flash
Text-To-SQLMMSQLTDEX64.1GPT-3.5 Turbo
Text-To-SQLMMSQLTDEX64Llama3-8B
Text-To-SQLMMSQLTDEX62.8Llama3-70B
Text-To-SQLMMSQLTDEX30.7SQLCoder-8B

Related Papers

Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16CogDDN: A Cognitive Demand-Driven Navigation with Decision Optimization and Dual-Process Thinking2025-07-15Privacy-Preserving Multi-Stage Fall Detection Framework with Semi-supervised Federated Learning and Robotic Vision Confirmation2025-07-14Automating MD simulations for Proteins using Large language Models: NAMD-Agent2025-07-10CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation2025-07-08Graph Learning2025-07-08XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL2025-07-07Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions2025-07-06