Xuefeng Bai, Yulong Chen, Yue Zhang
Abstract meaning representation (AMR) highlights the core semantic information of text in a graph structure. Recently, pre-trained language models (PLMs) have advanced tasks of AMR parsing and AMR-to-text generation, respectively. However, PLMs are typically pre-trained on textual data, thus are sub-optimal for modeling structural knowledge. To this end, we investigate graph self-supervised training to improve the structure awareness of PLMs over AMR graphs. In particular, we introduce two graph auto-encoding strategies for graph-to-graph pre-training and four tasks to integrate text and graph information during pre-training. We further design a unified framework to bridge the gap between pre-training and fine-tuning tasks. Experiments on both AMR parsing and AMR-to-text generation show the superiority of our model. To our knowledge, we are the first to consider pre-training on semantic graphs.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Parsing | The Little Prince | Smatch | 79.8 | AMRBART large |
| Semantic Parsing | LDC2017T10 | Smatch | 85.4 | AMRBART large |
| Semantic Parsing | LDC2020T02 | Smatch | 84.2 | AMRBART large |
| Semantic Parsing | New3 | Smatch | 76.9 | AMRBART large |
| Semantic Parsing | Bio | Smatch | 63.2 | AMRBART large |
| AMR Parsing | The Little Prince | Smatch | 79.8 | AMRBART large |
| AMR Parsing | LDC2017T10 | Smatch | 85.4 | AMRBART large |
| AMR Parsing | LDC2020T02 | Smatch | 84.2 | AMRBART large |
| AMR Parsing | New3 | Smatch | 76.9 | AMRBART large |
| AMR Parsing | Bio | Smatch | 63.2 | AMRBART large |