TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PICARD: Parsing Incrementally for Constrained Auto-Regress...

PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models

Torsten Scholak, Nathan Schucher, Dzmitry Bahdanau

2021-09-10EMNLP 2021 11Semantic ParsingText-To-SQLDialogue State TrackingTranslation
PaperPDFCodeCodeCode(official)

Abstract

Large pre-trained language models for textual data have an unconstrained output space; at each decoding step, they can produce any of 10,000s of sub-word tokens. When fine-tuned to target constrained formal languages like SQL, these models often generate invalid code, rendering it unusable. We propose PICARD (code and trained models available at https://github.com/ElementAI/picard), a method for constraining auto-regressive decoders of language models through incremental parsing. PICARD helps to find valid output sequences by rejecting inadmissible tokens at each decoding step. On the challenging Spider and CoSQL text-to-SQL translation tasks, we show that PICARD transforms fine-tuned T5 models with passable performance into state-of-the-art solutions.

Results

TaskDatasetMetricValueModel
DialogueCoSQLinteraction match accuracy23.7T5-3B + PICARD
DialogueCoSQLquestion match accuracy54.6T5-3B + PICARD
Semantic ParsingspiderAccuracy71.9T5-3B + PICARD
Semantic ParsingSPIDERExact Match Accuracy (in Dev)75.5T5-3B+PICARD
Semantic ParsingSPIDERExecution Accuracy (in Dev)79.3T5-3B+PICARD
Semantic ParsingSPIDERExact Match Accuracy (in Dev)71.5T5-3B
Semantic ParsingSPIDERExecution Accuracy (in Dev)74.4T5-3B
Semantic ParsingspiderExact Match Accuracy (Dev)75.5T5-3B + PICARD
Semantic ParsingspiderExact Match Accuracy (Test)71.9T5-3B + PICARD
Semantic ParsingspiderExecution Accuracy (Test)75.1T5-3B + PICARD
Text-To-SQLSPIDERExact Match Accuracy (in Dev)75.5T5-3B+PICARD
Text-To-SQLSPIDERExecution Accuracy (in Dev)79.3T5-3B+PICARD
Text-To-SQLSPIDERExact Match Accuracy (in Dev)71.5T5-3B
Text-To-SQLSPIDERExecution Accuracy (in Dev)74.4T5-3B
Text-To-SQLspiderExact Match Accuracy (Dev)75.5T5-3B + PICARD
Text-To-SQLspiderExact Match Accuracy (Test)71.9T5-3B + PICARD
Text-To-SQLspiderExecution Accuracy (Test)75.1T5-3B + PICARD

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation2025-07-08Unconditional Diffusion for Generative Sequential Recommendation2025-07-08XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL2025-07-07GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04