TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Data-to-Text Generation with Content Selection and Planning

Data-to-Text Generation with Content Selection and Planning

Ratish Puduppully, Li Dong, Mirella Lapata

2018-09-03Data-to-Text GenerationText GenerationDescriptive
PaperPDFCodeCode(official)

Abstract

Recent advances in data-to-text generation have led to the use of large-scale datasets and neural network models which are trained end-to-end, without explicitly modeling what to say and in what order. In this work, we present a neural network architecture which incorporates content selection and planning without sacrificing end-to-end training. We decompose the generation task into two stages. Given a corpus of data records (paired with descriptive documents), we first generate a content plan highlighting which information should be mentioned and in which order and then generate the document while taking the content plan into account. Automatic and human-based evaluation experiments show that our model outperforms strong baselines improving the state-of-the-art on the recently released RotoWire dataset.

Results

TaskDatasetMetricValueModel
Text GenerationRotoWire (Relation Generation)count34.28Neural Content Planning + conditional copy
Text GenerationRotoWire (Content Ordering)BLEU16.5Neural Content Planning + conditional copy
Text GenerationRotoWireBLEU16.5Neural Content Planning + conditional copy
Data-to-Text GenerationRotoWire (Relation Generation)count34.28Neural Content Planning + conditional copy
Data-to-Text GenerationRotoWire (Content Ordering)BLEU16.5Neural Content Planning + conditional copy
Data-to-Text GenerationRotoWireBLEU16.5Neural Content Planning + conditional copy

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization2025-07-17Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16Assay2Mol: large language model-based drug design using BioAssay context2025-07-16Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15