A Hierarchical Model for Data-to-Text Generation

Clément Rebuffel, Laure Soulier, Geoffrey Scoutheeten, Patrick Gallinari

2019-12-20Data-to-Text Generation Text Generation Translation

Abstract

Transcribing structured data into natural language descriptions has emerged as a challenging task, referred to as "data-to-text". These structures generally regroup multiple elements, as well as their attributes. Most attempts rely on translation encoder-decoder methods which linearize elements into a sequence. This however loses most of the structure contained in the data. In this work, we propose to overpass this limitation with a hierarchical model that encodes the data-structure at the element-level and the structure level. Evaluations on RotoWire show the effectiveness of our model w.r.t. qualitative and quantitative metrics.

Results

Task	Dataset	Metric	Value	Model
Text Generation	RotoWire (Relation Generation)	count	21.17	Hierarchical Transformer Encoder + conditional copy
Text Generation	RotoWire (Content Ordering)	BLEU	17.5	Hierarchical Transformer Encoder + conditional copy
Text Generation	RotoWire	BLEU	17.5	Hierarchical transformer encoder + conditional copy
Data-to-Text Generation	RotoWire (Relation Generation)	count	21.17	Hierarchical Transformer Encoder + conditional copy
Data-to-Text Generation	RotoWire (Content Ordering)	BLEU	17.5	Hierarchical Transformer Encoder + conditional copy
Data-to-Text Generation	RotoWire	BLEU	17.5	Hierarchical transformer encoder + conditional copy

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17 A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17 Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15 Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15 Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15 Function-to-Style Guidance of LLMs for Code Translation2025-07-15 Exploiting Leaderboards for Large-Scale Distribution of Malicious Models2025-07-11