TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TextBox 2.0: A Text Generation Library with Pre-trained La...

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu, Wenxun Dai, Zican Dong, Xiaoxue Cheng, Yuhao Wang, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

2022-12-26Machine TranslationQuestion AnsweringData-to-Text GenerationText GenerationStyle TransferAbstractive Text SummarizationStory GenerationTask-Oriented Dialogue SystemsQuestion GenerationText Simplification
PaperPDFCode(official)

Abstract

To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2.0, focusing on the use of pre-trained language models (PLMs). To be comprehensive, our library covers $13$ common text generation tasks and their corresponding $83$ datasets and further incorporates $45$ PLMs covering general, translation, Chinese, dialogue, controllable, distilled, prompting, and lightweight PLMs. We also implement $4$ efficient training strategies and provide $4$ generation objectives for pre-training new PLMs from scratch. To be unified, we design the interfaces to support the entire research pipeline (from data loading to training and evaluation), ensuring that each step can be fulfilled in a unified way. Despite the rich functionality, it is easy to use our library, either through the friendly Python API or command line. To validate the effectiveness of our library, we conduct extensive experiments and exemplify four types of research scenarios. The project is released at the link: https://github.com/RUCAIBox/TextBox.

Results

TaskDatasetMetricValueModel
SketchGYAFCAccuracy94.37BART (TextBox 2.0)
SketchGYAFCBLEU-476.93BART (TextBox 2.0)
SketchGYAFCHarmonic mean84.74BART (TextBox 2.0)
DialoguePersona-ChatBLEU-149.581BART (TextBox 2.0)
DialoguePersona-ChatBLEU-239.24BART (TextBox 2.0)
DialoguePersona-ChatDistinct-11.44BART (TextBox 2.0)
DialoguePersona-ChatDistinct-28.89BART (TextBox 2.0)
DialogueMULTIWOZ 2.0BLEU-420.17BART (TextBox 2.0)
DialogueMULTIWOZ 2.0Score100.07BART (TextBox 2.0)
Machine TranslationWMT2016 Romanian-EnglishBLEU-437.48BART (TextBox 2.0)
Machine TranslationWMT2016 English-RomanianBLEU-437.2BART (TextBox 2.0)
Style TransferGYAFCAccuracy94.37BART (TextBox 2.0)
Style TransferGYAFCBLEU-476.93BART (TextBox 2.0)
Style TransferGYAFCHarmonic mean84.74BART (TextBox 2.0)
Question AnsweringSQuAD1.1Exact Match86.44BART (TextBox 2.0)
Question AnsweringSQuAD1.1F193.04BART (TextBox 2.0)
Text GenerationADGENBLEU-410.2BART (TextBox 2.0)
Text GenerationCSLROUGE-L64.34BART (TextBox 2.0)
Text GenerationLCSTSROUGE-L42.96BART (TextBox 2.0)
Text GenerationCommonGenBLEU-428.18BART (TextBox 2.0)
Text GenerationCommonGenCIDEr12.98BART (TextBox 2.0)
Text GenerationCommonGenSPICE33BART (TextBox 2.0)
Text GenerationWebNLGBLEU-467.33BART (TextBox 2.0)
Text GenerationWebNLGMETEOR47.78BART (TextBox 2.0)
Text GenerationWebNLGROUGE-L76.83BART (TextBox 2.0)
Text GenerationWritingPromptsBLEU-133.79BART (TextBox 2.0)
Text GenerationWritingPromptsBLEU-215.78BART (TextBox 2.0)
Text GenerationWritingPromptsDistinct-478.762BART (TextBox 2.0)
Text SimplificationWiki-Auto + TurkBLEU-490.81BART (TextBox 2.0)
Text SimplificationWiki-Auto + TurkMETEOR57.58BART (TextBox 2.0)
Text SimplificationWiki-Auto + TurkROUGE-283.36BART (TextBox 2.0)
Text SummarizationCNN/Daily MailROUGE-144.47BART (TextBox 2.0)
Text SummarizationCNN/Daily MailROUGE-221.5BART (TextBox 2.0)
Text SummarizationCNN/Daily MailROUGE-L41.35BART (TextBox 2.0)
Abstractive Text SummarizationCNN/Daily MailROUGE-144.47BART (TextBox 2.0)
Abstractive Text SummarizationCNN/Daily MailROUGE-221.5BART (TextBox 2.0)
Abstractive Text SummarizationCNN/Daily MailROUGE-L41.35BART (TextBox 2.0)
Data-to-Text GenerationWebNLGBLEU-467.33BART (TextBox 2.0)
Data-to-Text GenerationWebNLGMETEOR47.78BART (TextBox 2.0)
Data-to-Text GenerationWebNLGROUGE-L76.83BART (TextBox 2.0)
Question GenerationSQuAD1.1BLEU-425.08BART (TextBox 2.0)
Question GenerationSQuAD1.1METEOR26.73BART (TextBox 2.0)
Question GenerationSQuAD1.1ROUGE-L52.55BART (TextBox 2.0)
2D Human Pose EstimationGYAFCAccuracy94.37BART (TextBox 2.0)
2D Human Pose EstimationGYAFCBLEU-476.93BART (TextBox 2.0)
2D Human Pose EstimationGYAFCHarmonic mean84.74BART (TextBox 2.0)
2D ClassificationGYAFCAccuracy94.37BART (TextBox 2.0)
2D ClassificationGYAFCBLEU-476.93BART (TextBox 2.0)
2D ClassificationGYAFCHarmonic mean84.74BART (TextBox 2.0)
Task-Oriented Dialogue SystemsMULTIWOZ 2.0BLEU-420.17BART (TextBox 2.0)
Task-Oriented Dialogue SystemsMULTIWOZ 2.0Score100.07BART (TextBox 2.0)
Story GenerationWritingPromptsBLEU-133.79BART (TextBox 2.0)
Story GenerationWritingPromptsBLEU-215.78BART (TextBox 2.0)
Story GenerationWritingPromptsDistinct-478.762BART (TextBox 2.0)
1 Image, 2*2 StitchiGYAFCAccuracy94.37BART (TextBox 2.0)
1 Image, 2*2 StitchiGYAFCBLEU-476.93BART (TextBox 2.0)
1 Image, 2*2 StitchiGYAFCHarmonic mean84.74BART (TextBox 2.0)
Drawing PicturesGYAFCAccuracy94.37BART (TextBox 2.0)
Drawing PicturesGYAFCBLEU-476.93BART (TextBox 2.0)
Drawing PicturesGYAFCHarmonic mean84.74BART (TextBox 2.0)

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16