Pei Ke, Haozhe Ji, Yu Ran, Xin Cui, LiWei Wang, Linfeng Song, Xiaoyan Zhu, Minlie Huang
Existing pre-trained models for knowledge-graph-to-text (KG-to-text) generation simply fine-tune text-to-text pre-trained models such as BART or T5 on KG-to-text datasets, which largely ignore the graph structure during encoding and lack elaborate pre-training tasks to explicitly model graph-text alignments. To tackle these problems, we propose a graph-text joint representation learning model called JointGT. During encoding, we devise a structure-aware semantic aggregation module which is plugged into each Transformer layer to preserve the graph structure. Furthermore, we propose three new pre-training tasks to explicitly enhance the graph-text alignment including respective text / graph reconstruction, and graph-text alignment in the embedding space via Optimal Transport. Experiments show that JointGT obtains new state-of-the-art performance on various KG-to-text datasets.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Text Generation | WebNLG 2.0 (Constrained) | BLEU | 61.01 | JointGT (T5) |
| Text Generation | WebNLG 2.0 (Constrained) | METEOR | 46.32 | JointGT (T5) |
| Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 73.57 | JointGT (T5) |
| Text Generation | WebNLG 2.0 (Constrained) | BLEU | 58.66 | T5 |
| Text Generation | WebNLG 2.0 (Constrained) | METEOR | 46.04 | T5 |
| Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 73.06 | T5 |
| Text Generation | WebNLG 2.0 (Constrained) | BLEU | 58.55 | JointGT (BART) |
| Text Generation | WebNLG 2.0 (Constrained) | METEOR | 45.01 | JointGT (BART) |
| Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 72.31 | JointGT (BART) |
| Text Generation | WebNLG 2.0 (Constrained) | BLEU | 56.65 | BART |
| Text Generation | WebNLG 2.0 (Constrained) | METEOR | 44.51 | BART |
| Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 70.94 | BART |
| Text Generation | WebQuestions | BLEU | 30.02 | JointGT (BART) |
| Text Generation | WebQuestions | METEOR | 32.05 | JointGT (BART) |
| Text Generation | WebQuestions | ROUGE | 55.6 | JointGT (BART) |
| Text Generation | WebQuestions | BLEU | 29.61 | BART |
| Text Generation | WebQuestions | METEOR | 31.48 | BART |
| Text Generation | WebQuestions | ROUGE | 55.42 | BART |
| Text Generation | WebQuestions | BLEU | 28.95 | JointGT (T5) |
| Text Generation | WebQuestions | METEOR | 31.29 | JointGT (T5) |
| Text Generation | WebQuestions | ROUGE | 54.47 | JointGT (T5) |
| Text Generation | WebQuestions | BLEU | 28.78 | T5 |
| Text Generation | WebQuestions | METEOR | 30.55 | T5 |
| Text Generation | WebQuestions | ROUGE | 55.12 | T5 |
| Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 66.14 | JointGT (T5) |
| Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 47.25 | JointGT (T5) |
| Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 75.91 | JointGT (T5) |
| Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 65.92 | JointGT (BART) |
| Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 47.15 | JointGT (BART) |
| Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 76.1 | JointGT (BART) |
| Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 64.55 | BART |
| Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 46.51 | BART |
| Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 75.13 | BART |
| Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 64.42 | T5 |
| Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 46.58 | T5 |
| Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 74.77 | T5 |
| Text Generation | PathQuestion | BLEU | 65.89 | JointGT (BART) |
| Text Generation | PathQuestion | METEOR | 48.25 | JointGT (BART) |
| Text Generation | PathQuestion | ROUGE | 78.87 | JointGT (BART) |
| Text Generation | PathQuestion | BLEU | 63.74 | BART |
| Text Generation | PathQuestion | METEOR | 47.23 | BART |
| Text Generation | PathQuestion | ROUGE | 77.76 | BART |
| Text Generation | PathQuestion | BLEU | 60.45 | JointGT (T5) |
| Text Generation | PathQuestion | METEOR | 45.38 | JointGT (T5) |
| Text Generation | PathQuestion | ROUGE | 77.59 | JointGT (T5) |
| Text Generation | PathQuestion | BLEU | 58.95 | T5 |
| Text Generation | PathQuestion | METEOR | 44.72 | T5 |
| Text Generation | PathQuestion | ROUGE | 76.58 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 61.01 | JointGT (T5) |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 46.32 | JointGT (T5) |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 73.57 | JointGT (T5) |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 58.66 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 46.04 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 73.06 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 58.55 | JointGT (BART) |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 45.01 | JointGT (BART) |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 72.31 | JointGT (BART) |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 56.65 | BART |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 44.51 | BART |
| Data-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 70.94 | BART |
| Data-to-Text Generation | WebQuestions | BLEU | 30.02 | JointGT (BART) |
| Data-to-Text Generation | WebQuestions | METEOR | 32.05 | JointGT (BART) |
| Data-to-Text Generation | WebQuestions | ROUGE | 55.6 | JointGT (BART) |
| Data-to-Text Generation | WebQuestions | BLEU | 29.61 | BART |
| Data-to-Text Generation | WebQuestions | METEOR | 31.48 | BART |
| Data-to-Text Generation | WebQuestions | ROUGE | 55.42 | BART |
| Data-to-Text Generation | WebQuestions | BLEU | 28.95 | JointGT (T5) |
| Data-to-Text Generation | WebQuestions | METEOR | 31.29 | JointGT (T5) |
| Data-to-Text Generation | WebQuestions | ROUGE | 54.47 | JointGT (T5) |
| Data-to-Text Generation | WebQuestions | BLEU | 28.78 | T5 |
| Data-to-Text Generation | WebQuestions | METEOR | 30.55 | T5 |
| Data-to-Text Generation | WebQuestions | ROUGE | 55.12 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 66.14 | JointGT (T5) |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 47.25 | JointGT (T5) |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 75.91 | JointGT (T5) |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 65.92 | JointGT (BART) |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 47.15 | JointGT (BART) |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 76.1 | JointGT (BART) |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 64.55 | BART |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 46.51 | BART |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 75.13 | BART |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 64.42 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 46.58 | T5 |
| Data-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 74.77 | T5 |
| Data-to-Text Generation | PathQuestion | BLEU | 65.89 | JointGT (BART) |
| Data-to-Text Generation | PathQuestion | METEOR | 48.25 | JointGT (BART) |
| Data-to-Text Generation | PathQuestion | ROUGE | 78.87 | JointGT (BART) |
| Data-to-Text Generation | PathQuestion | BLEU | 63.74 | BART |
| Data-to-Text Generation | PathQuestion | METEOR | 47.23 | BART |
| Data-to-Text Generation | PathQuestion | ROUGE | 77.76 | BART |
| Data-to-Text Generation | PathQuestion | BLEU | 60.45 | JointGT (T5) |
| Data-to-Text Generation | PathQuestion | METEOR | 45.38 | JointGT (T5) |
| Data-to-Text Generation | PathQuestion | ROUGE | 77.59 | JointGT (T5) |
| Data-to-Text Generation | PathQuestion | BLEU | 58.95 | T5 |
| Data-to-Text Generation | PathQuestion | METEOR | 44.72 | T5 |
| Data-to-Text Generation | PathQuestion | ROUGE | 76.58 | T5 |
| Question Generation | GrailQA-Zero-Shot | FactSpotter | 94.15 | JointGT |
| Question Generation | GrailQA-Zero-Shot | METEOR | 37.69 | JointGT |
| Question Generation | GrailQA-Zero-Shot | bleu | 32.94 | JointGT |
| Question Generation | GrailQA-Compositional | BLEU | 31.46 | JointGT |
| Question Generation | GrailQA-Compositional | FactSpotter | 95.26 | JointGT |
| Question Generation | GrailQA-Compositional | METEOR | 36.08 | JointGT |
| Question Generation | GrailQA-IID | BLEU | 45.95 | JointGT |
| Question Generation | GrailQA-IID | FactSpotter | 98.62 | JointGT |
| Question Generation | GrailQA-IID | METEOR | 41.65 | JointGT |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 61.01 | JointGT (T5) |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 46.32 | JointGT (T5) |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 73.57 | JointGT (T5) |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 58.66 | T5 |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 46.04 | T5 |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 73.06 | T5 |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 58.55 | JointGT (BART) |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 45.01 | JointGT (BART) |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 72.31 | JointGT (BART) |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | BLEU | 56.65 | BART |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | METEOR | 44.51 | BART |
| KG-to-Text Generation | WebNLG 2.0 (Constrained) | ROUGE | 70.94 | BART |
| KG-to-Text Generation | WebQuestions | BLEU | 30.02 | JointGT (BART) |
| KG-to-Text Generation | WebQuestions | METEOR | 32.05 | JointGT (BART) |
| KG-to-Text Generation | WebQuestions | ROUGE | 55.6 | JointGT (BART) |
| KG-to-Text Generation | WebQuestions | BLEU | 29.61 | BART |
| KG-to-Text Generation | WebQuestions | METEOR | 31.48 | BART |
| KG-to-Text Generation | WebQuestions | ROUGE | 55.42 | BART |
| KG-to-Text Generation | WebQuestions | BLEU | 28.95 | JointGT (T5) |
| KG-to-Text Generation | WebQuestions | METEOR | 31.29 | JointGT (T5) |
| KG-to-Text Generation | WebQuestions | ROUGE | 54.47 | JointGT (T5) |
| KG-to-Text Generation | WebQuestions | BLEU | 28.78 | T5 |
| KG-to-Text Generation | WebQuestions | METEOR | 30.55 | T5 |
| KG-to-Text Generation | WebQuestions | ROUGE | 55.12 | T5 |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 66.14 | JointGT (T5) |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 47.25 | JointGT (T5) |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 75.91 | JointGT (T5) |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 65.92 | JointGT (BART) |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 47.15 | JointGT (BART) |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 76.1 | JointGT (BART) |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 64.55 | BART |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 46.51 | BART |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 75.13 | BART |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | BLEU | 64.42 | T5 |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | METEOR | 46.58 | T5 |
| KG-to-Text Generation | WebNLG 2.0 (Unconstrained) | ROUGE | 74.77 | T5 |
| KG-to-Text Generation | PathQuestion | BLEU | 65.89 | JointGT (BART) |
| KG-to-Text Generation | PathQuestion | METEOR | 48.25 | JointGT (BART) |
| KG-to-Text Generation | PathQuestion | ROUGE | 78.87 | JointGT (BART) |
| KG-to-Text Generation | PathQuestion | BLEU | 63.74 | BART |
| KG-to-Text Generation | PathQuestion | METEOR | 47.23 | BART |
| KG-to-Text Generation | PathQuestion | ROUGE | 77.76 | BART |
| KG-to-Text Generation | PathQuestion | BLEU | 60.45 | JointGT (T5) |
| KG-to-Text Generation | PathQuestion | METEOR | 45.38 | JointGT (T5) |
| KG-to-Text Generation | PathQuestion | ROUGE | 77.59 | JointGT (T5) |
| KG-to-Text Generation | PathQuestion | BLEU | 58.95 | T5 |
| KG-to-Text Generation | PathQuestion | METEOR | 44.72 | T5 |
| KG-to-Text Generation | PathQuestion | ROUGE | 76.58 | T5 |