TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HTLM: Hyper-Text Pre-Training and Prompting of Language Mo...

HTLM: Hyper-Text Pre-Training and Prompting of Language Models

Armen Aghajanyan, Dmytro Okhonko, Mike Lewis, Mandar Joshi, Hu Xu, Gargi Ghosh, Luke Zettlemoyer

2021-07-14ICLR 2022 4DenoisingData-to-Text GenerationTable-to-Text GenerationLanguage Modelling
PaperPDF

Abstract

We introduce HTLM, a hyper-text language model trained on a large-scale web crawl. Modeling hyper-text has a number of advantages: (1) it is easily gathered at scale, (2) it provides rich document-level and end-task-adjacent supervision (e.g. class and id attributes often encode document category information), and (3) it allows for new structured prompting that follows the established semantics of HTML (e.g. to do zero-shot summarization by infilling title tags for a webpage that contains the input text). We show that pretraining with a BART-style denoising loss directly on simplified HTML provides highly effective transfer for a wide range of end tasks and supervision levels. HTLM matches or exceeds the performance of comparably sized text-only LMs for zero-shot prompting and fine-tuning for classification benchmarks, while also setting new state-of-the-art performance levels for zero-shot summarization. We also find that hyper-text prompts provide more value to HTLM, in terms of data efficiency, than plain text prompts do for existing LMs, and that HTLM is highly effective at auto-prompting itself, by simply generating the most likely hyper-text formatting for any available training data. We will release all code and models to support future HTLM research.

Results

TaskDatasetMetricValueModel
Text GenerationWebNLGBLEU65.4HTML (fine-tuning)
Text GenerationWebNLG FullBLEU56.3HTLM (prefix 0.1%)
Text GenerationWebNLG (Seen)BLEU65.4HTLM (fine-tuning)
Text GenerationWebNLG (Seen)METEOR0.46HTLM (fine-tuning)
Text GenerationWebNLG (Seen)TER0.33HTLM (fine-tuning)
Text GenerationWebNLG (Seen)BLEU65.3GPT-2-Large (fine-tuning)
Text GenerationWebNLG (Seen)METEOR0.46GPT-2-Large (fine-tuning)
Text GenerationWebNLG (Seen)TER0.33GPT-2-Large (fine-tuning)
Text GenerationDARTBERT0.94HTLM (fine-tuning)
Text GenerationDARTBLEU47.2HTLM (fine-tuning)
Text GenerationDARTBLEURT0.4HTLM (fine-tuning)
Text GenerationDARTMETEOR0.39HTLM (fine-tuning)
Text GenerationDARTMover0.51HTLM (fine-tuning)
Text GenerationDARTTER0.44HTLM (fine-tuning)
Text GenerationDARTBERT0.94GPT-2-Large (fine-tuning)
Text GenerationDARTBLEU47GPT-2-Large (fine-tuning)
Text GenerationDARTBLEURT0.4GPT-2-Large (fine-tuning)
Text GenerationDARTMETEOR0.39GPT-2-Large (fine-tuning)
Text GenerationDARTMover0.51GPT-2-Large (fine-tuning)
Text GenerationDARTTER0.46GPT-2-Large (fine-tuning)
Text GenerationWebNLG (All)BLEU55.6HTLM (fine-tuning)
Text GenerationWebNLG (All)METEOR0.42HTLM (fine-tuning)
Text GenerationWebNLG (All)TER0.4HTLM (fine-tuning)
Text GenerationWebNLG (All)BLEU55.5GPT-2-Large (fine-tuning)
Text GenerationWebNLG (All)METEOR0.42GPT-2-Large (fine-tuning)
Text GenerationWebNLG (All)TER0.42GPT-2-Large (fine-tuning)
Text GenerationWebNLG (Unseen)BLEU48.4HTLM (fine-tuning)
Text GenerationWebNLG (Unseen)METEOR0.39HTLM (fine-tuning)
Text GenerationWebNLG (Unseen)TER0.51HTLM (fine-tuning)
Text GenerationWebNLG (Unseen)BLEU43.1GPT-2-Large (fine-tuning)
Text GenerationWebNLG (Unseen)METEOR0.38GPT-2-Large (fine-tuning)
Text GenerationWebNLG (Unseen)TER0.53GPT-2-Large (fine-tuning)
Text GenerationE2EBLEU70.3HTLM (fine-tuning)
Text GenerationE2ECIDEr2.47HTLM (fine-tuning)
Text GenerationE2EMETEOR46.3HTLM (fine-tuning)
Text GenerationE2ENIST8.9HTLM (fine-tuning)
Text GenerationE2EROUGE-L70.8HTLM (fine-tuning)
Text GenerationE2EBLEU68.5GPT-2-Large (fine-tuning)
Text GenerationE2ECIDEr2.45GPT-2-Large (fine-tuning)
Text GenerationE2EMETEOR46GPT-2-Large (fine-tuning)
Text GenerationE2ENIST8.78GPT-2-Large (fine-tuning)
Text GenerationE2EROUGE-L69.9GPT-2-Large (fine-tuning)
Data-to-Text GenerationWebNLGBLEU65.4HTML (fine-tuning)
Data-to-Text GenerationWebNLG FullBLEU56.3HTLM (prefix 0.1%)
Table-to-Text GenerationWebNLG (Seen)BLEU65.4HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (Seen)METEOR0.46HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (Seen)TER0.33HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (Seen)BLEU65.3GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (Seen)METEOR0.46GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (Seen)TER0.33GPT-2-Large (fine-tuning)
Table-to-Text GenerationDARTBERT0.94HTLM (fine-tuning)
Table-to-Text GenerationDARTBLEU47.2HTLM (fine-tuning)
Table-to-Text GenerationDARTBLEURT0.4HTLM (fine-tuning)
Table-to-Text GenerationDARTMETEOR0.39HTLM (fine-tuning)
Table-to-Text GenerationDARTMover0.51HTLM (fine-tuning)
Table-to-Text GenerationDARTTER0.44HTLM (fine-tuning)
Table-to-Text GenerationDARTBERT0.94GPT-2-Large (fine-tuning)
Table-to-Text GenerationDARTBLEU47GPT-2-Large (fine-tuning)
Table-to-Text GenerationDARTBLEURT0.4GPT-2-Large (fine-tuning)
Table-to-Text GenerationDARTMETEOR0.39GPT-2-Large (fine-tuning)
Table-to-Text GenerationDARTMover0.51GPT-2-Large (fine-tuning)
Table-to-Text GenerationDARTTER0.46GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (All)BLEU55.6HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (All)METEOR0.42HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (All)TER0.4HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (All)BLEU55.5GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (All)METEOR0.42GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (All)TER0.42GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (Unseen)BLEU48.4HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (Unseen)METEOR0.39HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (Unseen)TER0.51HTLM (fine-tuning)
Table-to-Text GenerationWebNLG (Unseen)BLEU43.1GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (Unseen)METEOR0.38GPT-2-Large (fine-tuning)
Table-to-Text GenerationWebNLG (Unseen)TER0.53GPT-2-Large (fine-tuning)
Table-to-Text GenerationE2EBLEU70.3HTLM (fine-tuning)
Table-to-Text GenerationE2ECIDEr2.47HTLM (fine-tuning)
Table-to-Text GenerationE2EMETEOR46.3HTLM (fine-tuning)
Table-to-Text GenerationE2ENIST8.9HTLM (fine-tuning)
Table-to-Text GenerationE2EROUGE-L70.8HTLM (fine-tuning)
Table-to-Text GenerationE2EBLEU68.5GPT-2-Large (fine-tuning)
Table-to-Text GenerationE2ECIDEr2.45GPT-2-Large (fine-tuning)
Table-to-Text GenerationE2EMETEOR46GPT-2-Large (fine-tuning)
Table-to-Text GenerationE2ENIST8.78GPT-2-Large (fine-tuning)
Table-to-Text GenerationE2EROUGE-L69.9GPT-2-Large (fine-tuning)

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16