Graph Pre-training for AMR Parsing and Generation

Xuefeng Bai, Yulong Chen, Yue Zhang

2022-03-15ACL 2022 5Text Generation AMR-to-Text Generation AMR Parsing

Abstract

Abstract meaning representation (AMR) highlights the core semantic information of text in a graph structure. Recently, pre-trained language models (PLMs) have advanced tasks of AMR parsing and AMR-to-text generation, respectively. However, PLMs are typically pre-trained on textual data, thus are sub-optimal for modeling structural knowledge. To this end, we investigate graph self-supervised training to improve the structure awareness of PLMs over AMR graphs. In particular, we introduce two graph auto-encoding strategies for graph-to-graph pre-training and four tasks to integrate text and graph information during pre-training. We further design a unified framework to bridge the gap between pre-training and fine-tuning tasks. Experiments on both AMR parsing and AMR-to-text generation show the superiority of our model. To our knowledge, we are the first to consider pre-training on semantic graphs.

Results

Task	Dataset	Metric	Value	Model
Semantic Parsing	The Little Prince	Smatch	79.8	AMRBART large
Semantic Parsing	LDC2017T10	Smatch	85.4	AMRBART large
Semantic Parsing	LDC2020T02	Smatch	84.2	AMRBART large
Semantic Parsing	New3	Smatch	76.9	AMRBART large
Semantic Parsing	Bio	Smatch	63.2	AMRBART large
AMR Parsing	The Little Prince	Smatch	79.8	AMRBART large
AMR Parsing	LDC2017T10	Smatch	85.4	AMRBART large
AMR Parsing	LDC2020T02	Smatch	84.2	AMRBART large
AMR Parsing	New3	Smatch	76.9	AMRBART large
AMR Parsing	Bio	Smatch	63.2	AMRBART large

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17 Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15 Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15 Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15 Exploiting Leaderboards for Large-Scale Distribution of Malicious Models2025-07-11 CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs2025-07-09 FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation2025-07-09