TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Data...

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset

Luyu Wang, Yujia Li, Ozlem Aslan, Oriol Vinyals

2021-07-20NAACL (TextGraphs) 2021 6KG-to-Text GenerationText GenerationGraph Representation LearningRepresentation LearningText RetrievalRetrievalConditional Text GenerationGraph Generation
PaperPDFCode(official)

Abstract

We present a new dataset of Wikipedia articles each paired with a knowledge graph, to facilitate the research in conditional text generation, graph generation and graph representation learning. Existing graph-text paired datasets typically contain small graphs and short text (1 or few sentences), thus limiting the capabilities of the models that can be learned on the data. Our new dataset WikiGraphs is collected by pairing each Wikipedia article from the established WikiText-103 benchmark (Merity et al., 2016) with a subgraph from the Freebase knowledge graph (Bollacker et al., 2008). This makes it easy to benchmark against other state-of-the-art text generative models that are capable of generating long paragraphs of coherent text. Both the graphs and the text data are of significantly larger scale compared to prior graph-text paired datasets. We present baseline graph neural network and transformer model results on our dataset for 3 tasks: graph -> text generation, graph -> text retrieval and text -> graph retrieval. We show that better conditioning on the graph provides gains in generation and retrieval quality but there is still large room for improvement.

Results

TaskDatasetMetricValueModel
Text GenerationWikiGraphsTest perplexity25.85Unconditional
Text GenerationWikiGraphsrBLEU (Test)9.98Unconditional
Text GenerationWikiGraphsrBLEU (Valid)10.97Unconditional
Text GenerationWikiGraphsrBLEU(w/title)(Test)24.07Unconditional
Text GenerationWikiGraphsrBLEU(w/title)(Valid)27.98Unconditional
Text GenerationWikiGraphsTest perplexity26.65BoW
Text GenerationWikiGraphsrBLEU (Test)24.41BoW
Text GenerationWikiGraphsrBLEU (Valid)29.53BoW
Text GenerationWikiGraphsrBLEU(w/title)(Test)27.39BoW
Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.41BoW
Text GenerationWikiGraphsTest perplexity26.93GNN
Text GenerationWikiGraphsrBLEU (Test)26.22GNN
Text GenerationWikiGraphsrBLEU (Valid)31.39GNN
Text GenerationWikiGraphsrBLEU(w/title)(Test)28.35GNN
Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.65GNN
Text GenerationWikiGraphsTest perplexity27.4Nodes
Text GenerationWikiGraphsrBLEU (Test)25.31Nodes
Text GenerationWikiGraphsrBLEU (Valid)30.51Nodes
Text GenerationWikiGraphsrBLEU(w/title)(Test)27.43Nodes
Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.6Nodes
Data-to-Text GenerationWikiGraphsTest perplexity25.85Unconditional
Data-to-Text GenerationWikiGraphsrBLEU (Test)9.98Unconditional
Data-to-Text GenerationWikiGraphsrBLEU (Valid)10.97Unconditional
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)24.07Unconditional
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)27.98Unconditional
Data-to-Text GenerationWikiGraphsTest perplexity26.65BoW
Data-to-Text GenerationWikiGraphsrBLEU (Test)24.41BoW
Data-to-Text GenerationWikiGraphsrBLEU (Valid)29.53BoW
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)27.39BoW
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.41BoW
Data-to-Text GenerationWikiGraphsTest perplexity26.93GNN
Data-to-Text GenerationWikiGraphsrBLEU (Test)26.22GNN
Data-to-Text GenerationWikiGraphsrBLEU (Valid)31.39GNN
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)28.35GNN
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.65GNN
Data-to-Text GenerationWikiGraphsTest perplexity27.4Nodes
Data-to-Text GenerationWikiGraphsrBLEU (Test)25.31Nodes
Data-to-Text GenerationWikiGraphsrBLEU (Valid)30.51Nodes
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)27.43Nodes
Data-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.6Nodes
KG-to-Text GenerationWikiGraphsTest perplexity25.85Unconditional
KG-to-Text GenerationWikiGraphsrBLEU (Test)9.98Unconditional
KG-to-Text GenerationWikiGraphsrBLEU (Valid)10.97Unconditional
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)24.07Unconditional
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)27.98Unconditional
KG-to-Text GenerationWikiGraphsTest perplexity26.65BoW
KG-to-Text GenerationWikiGraphsrBLEU (Test)24.41BoW
KG-to-Text GenerationWikiGraphsrBLEU (Valid)29.53BoW
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)27.39BoW
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.41BoW
KG-to-Text GenerationWikiGraphsTest perplexity26.93GNN
KG-to-Text GenerationWikiGraphsrBLEU (Test)26.22GNN
KG-to-Text GenerationWikiGraphsrBLEU (Valid)31.39GNN
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)28.35GNN
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.65GNN
KG-to-Text GenerationWikiGraphsTest perplexity27.4Nodes
KG-to-Text GenerationWikiGraphsrBLEU (Test)25.31Nodes
KG-to-Text GenerationWikiGraphsrBLEU (Valid)30.51Nodes
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Test)27.43Nodes
KG-to-Text GenerationWikiGraphsrBLEU(w/title)(Valid)32.6Nodes

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Making Language Model a Hierarchical Classifier and Generator2025-07-17SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17