EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation

Anthony Colas, Ali Sadeghian, Yue Wang, Daisy Zhe Wang

2021-10-30KG-to-Text Generation Knowledge Graphs Text Generation World Knowledge

Abstract

We introduce EventNarrative, a knowledge graph-to-text dataset from publicly available open-world knowledge graphs. Given the recent advances in event-driven Information Extraction (IE), and that prior research on graph-to-text only focused on entity-driven KGs, this paper focuses on event-centric data. However, our data generation system can still be adapted to other other types of KG data. Existing large-scale datasets in the graph-to-text area are non-parallel, meaning there is a large disconnect between the KGs and text. The datasets that have a paired KG and text, are small scale and manually generated or generated without a rich ontology, making the corresponding graphs sparse. Furthermore, these datasets contain many unlinked entities between their KG and text pairs. EventNarrative consists of approximately 230,000 graphs and their corresponding natural language text, 6 times larger than the current largest parallel dataset. It makes use of a rich ontology, all of the KGs entities are linked to the text, and our manual annotations confirm a high data quality. Our aim is two-fold: help break new ground in event-centric research where data is lacking, and to give researchers a well-defined, large-scale dataset in order to better evaluate existing and future knowledge graph-to-text models. We also evaluate two types of baseline on EventNarrative: a graph-to-text specific model and two state-of-the-art language models, which previous work has shown to be adaptable to the knowledge graph-to-text domain.

Results

Task	Dataset	Metric	Value	Model
Text Generation	EventNarrative	CIDEr	3.31	BART
Text Generation	EventNarrative	ChrF++	64.71	BART
Text Generation	EventNarrative	BLEU	30.78	GraphWriter
Text Generation	EventNarrative	BertScore	92.12	GraphWriter
Text Generation	EventNarrative	CIDEr	4.59	GraphWriter
Text Generation	EventNarrative	ChrF++	47.91	GraphWriter
Text Generation	EventNarrative	METEOR	27.72	GraphWriter
Text Generation	EventNarrative	ROUGE	71.92	GraphWriter
Text Generation	EventNarrative	CIDEr	3	T5
Text Generation	EventNarrative	ChrF++	56.76	T5
Data-to-Text Generation	EventNarrative	CIDEr	3.31	BART
Data-to-Text Generation	EventNarrative	ChrF++	64.71	BART
Data-to-Text Generation	EventNarrative	BLEU	30.78	GraphWriter
Data-to-Text Generation	EventNarrative	BertScore	92.12	GraphWriter
Data-to-Text Generation	EventNarrative	CIDEr	4.59	GraphWriter
Data-to-Text Generation	EventNarrative	ChrF++	47.91	GraphWriter
Data-to-Text Generation	EventNarrative	METEOR	27.72	GraphWriter
Data-to-Text Generation	EventNarrative	ROUGE	71.92	GraphWriter
Data-to-Text Generation	EventNarrative	CIDEr	3	T5
Data-to-Text Generation	EventNarrative	ChrF++	56.76	T5
KG-to-Text Generation	EventNarrative	CIDEr	3.31	BART
KG-to-Text Generation	EventNarrative	ChrF++	64.71	BART
KG-to-Text Generation	EventNarrative	BLEU	30.78	GraphWriter
KG-to-Text Generation	EventNarrative	BertScore	92.12	GraphWriter
KG-to-Text Generation	EventNarrative	CIDEr	4.59	GraphWriter
KG-to-Text Generation	EventNarrative	ChrF++	47.91	GraphWriter
KG-to-Text Generation	EventNarrative	METEOR	27.72	GraphWriter
KG-to-Text Generation	EventNarrative	ROUGE	71.92	GraphWriter
KG-to-Text Generation	EventNarrative	CIDEr	3	T5
KG-to-Text Generation	EventNarrative	ChrF++	56.76	T5

EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation

Abstract

Results

Related Papers

EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation

Abstract

Results

Related Papers