Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques

Daking Rai, Bailin Wang, Yilun Zhou, Ziyu Yao

2023-05-27Semantic Parsing Text-To-SQL Domain Generalization Language Modelling

Abstract

Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs). In this study, we empirically investigate improving an LM's generalization in semantic parsing with two simple techniques: at the token level, we introduce a token preprocessing method to preserve the semantic boundaries of tokens produced by LM tokenizers; at the sequence level, we propose to use special tokens to mark the boundaries of components aligned between input and output. Our experimental results on two text-to-SQL semantic parsing datasets show that our token preprocessing, although simple, can substantially improve the LM performance on both types of generalization, and our component boundary marking method is particularly helpful for compositional generalization.

Results

Task	Dataset	Metric	Value	Model
Semantic Parsing	spider	Exact Match Accuracy (Dev)	69.4	T5-3B+NatSQL+Token Preprocessing
Semantic Parsing	spider	Execution Accuracy (Dev)	73.7	T5-3B+NatSQL+Token Preprocessing
Semantic Parsing	spider	Execution Accuracy (Test)	78	T5-3B+NatSQL+Token Preprocessing
Text-To-SQL	spider	Exact Match Accuracy (Dev)	69.4	T5-3B+NatSQL+Token Preprocessing
Text-To-SQL	spider	Execution Accuracy (Dev)	73.7	T5-3B+NatSQL+Token Preprocessing
Text-To-SQL	spider	Execution Accuracy (Test)	78	T5-3B+NatSQL+Token Preprocessing

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21 Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17 GLAD: Generalizable Tuning for Vision-Language Models2025-07-17 MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17 Making Language Model a Hierarchical Classifier and Generator2025-07-17 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17 The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17