TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Improving Generalization in Language Model-Based Text-to-S...

Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques

Daking Rai, Bailin Wang, Yilun Zhou, Ziyu Yao

2023-05-27Semantic ParsingText-To-SQLDomain GeneralizationLanguage Modelling
PaperPDFCode(official)

Abstract

Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs). In this study, we empirically investigate improving an LM's generalization in semantic parsing with two simple techniques: at the token level, we introduce a token preprocessing method to preserve the semantic boundaries of tokens produced by LM tokenizers; at the sequence level, we propose to use special tokens to mark the boundaries of components aligned between input and output. Our experimental results on two text-to-SQL semantic parsing datasets show that our token preprocessing, although simple, can substantially improve the LM performance on both types of generalization, and our component boundary marking method is particularly helpful for compositional generalization.

Results

TaskDatasetMetricValueModel
Semantic ParsingspiderExact Match Accuracy (Dev)69.4T5-3B+NatSQL+Token Preprocessing
Semantic ParsingspiderExecution Accuracy (Dev)73.7T5-3B+NatSQL+Token Preprocessing
Semantic ParsingspiderExecution Accuracy (Test)78T5-3B+NatSQL+Token Preprocessing
Text-To-SQLspiderExact Match Accuracy (Dev)69.4T5-3B+NatSQL+Token Preprocessing
Text-To-SQLspiderExecution Accuracy (Dev)73.7T5-3B+NatSQL+Token Preprocessing
Text-To-SQLspiderExecution Accuracy (Test)78T5-3B+NatSQL+Token Preprocessing

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17