TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Data-to-text Generation with Macro Planning

Data-to-text Generation with Macro Planning

Ratish Puduppully, Mirella Lapata

2021-02-04Data-to-Text GenerationText Generation
PaperPDFCode(official)

Abstract

Recent approaches to data-to-text generation have adopted the very successful encoder-decoder architecture or variants thereof. These models generate text which is fluent (but often imprecise) and perform quite poorly at selecting appropriate content and ordering it coherently. To overcome some of these issues, we propose a neural model with a macro planning stage followed by a generation stage reminiscent of traditional methods which embrace separate modules for planning and surface realization. Macro plans represent high level organization of important content such as entities, events and their interactions; they are learnt from data and given as input to the generator. Extensive experiments on two data-to-text benchmarks (RotoWire and MLB) show that our approach outperforms competitive baselines in terms of automatic and human evaluation.

Results

TaskDatasetMetricValueModel
Text GenerationRotoWire (Relation Generation)Precision97.6Macro
Text GenerationRotoWire (Relation Generation)count42.1Macro
Text GenerationMLB Dataset (Content Selection)Precision40.8Macro
Text GenerationMLB Dataset (Content Selection)Recall54.9Macro
Text GenerationMLB Dataset (Content Ordering)DLD21.8Macro
Text GenerationMLB Dataset (Content Ordering)DLD20.7ENT
Text GenerationMLB DatasetBLEU12.62Macro
Text GenerationRotoWireBLEU15.46Macro
Text GenerationMLB Dataset (Relation Generation)Precision94.4Macro
Text GenerationMLB Dataset (Relation Generation)count30.8Macro
Text GenerationMLB Dataset (Relation Generation)Precision81.1ENT
Text GenerationMLB Dataset (Relation Generation)count23.8ENT
Data-to-Text GenerationRotoWire (Relation Generation)Precision97.6Macro
Data-to-Text GenerationRotoWire (Relation Generation)count42.1Macro
Data-to-Text GenerationMLB Dataset (Content Selection)Precision40.8Macro
Data-to-Text GenerationMLB Dataset (Content Selection)Recall54.9Macro
Data-to-Text GenerationMLB Dataset (Content Ordering)DLD21.8Macro
Data-to-Text GenerationMLB Dataset (Content Ordering)DLD20.7ENT
Data-to-Text GenerationMLB DatasetBLEU12.62Macro
Data-to-Text GenerationRotoWireBLEU15.46Macro
Data-to-Text GenerationMLB Dataset (Relation Generation)Precision94.4Macro
Data-to-Text GenerationMLB Dataset (Relation Generation)count30.8Macro
Data-to-Text GenerationMLB Dataset (Relation Generation)Precision81.1ENT
Data-to-Text GenerationMLB Dataset (Relation Generation)count23.8ENT

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Exploiting Leaderboards for Large-Scale Distribution of Malicious Models2025-07-11CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs2025-07-09FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation2025-07-09