TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Soft Layer-Specific Multi-Task Summarization with Entailme...

Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation

Han Guo, Ramakanth Pasunuru, Mohit Bansal

2018-05-28ACL 2018 7Abstractive Text SummarizationMulti-Task LearningQuestion Generation
PaperPDF

Abstract

An accurate abstractive summary of a document should contain all its salient information and should be logically entailed by the input document. We improve these important aspects of abstractive summarization via multi-task learning with the auxiliary tasks of question generation and entailment generation, where the former teaches the summarization model how to look for salient questioning-worthy details, and the latter teaches the model how to rewrite a summary which is a directed-logical subset of the input document. We also propose novel multi-task architectures with high-level (semantic) layer-specific sharing across multiple encoder and decoder layers of the three tasks, as well as soft-sharing mechanisms (and show performance ablations and analysis examples of each contribution). Overall, we achieve statistically significant improvements over the state-of-the-art on both the CNN/DailyMail and Gigaword datasets, as well as on the DUC-2002 transfer setup. We also present several quantitative and qualitative analysis studies of our model's learned saliency and entailment skills.

Results

TaskDatasetMetricValueModel
Text SummarizationGigaWordROUGE-135.98Pointer + Coverage + EntailmentGen + QuestionGen
Text SummarizationGigaWordROUGE-217.76Pointer + Coverage + EntailmentGen + QuestionGen
Text SummarizationGigaWordROUGE-L33.63Pointer + Coverage + EntailmentGen + QuestionGen
Text SummarizationCNN / Daily MailROUGE-139.81Pointer + Coverage + EntailmentGen + QuestionGen
Text SummarizationCNN / Daily MailROUGE-217.64Pointer + Coverage + EntailmentGen + QuestionGen
Text SummarizationCNN / Daily MailROUGE-L36.54Pointer + Coverage + EntailmentGen + QuestionGen
Abstractive Text SummarizationCNN / Daily MailROUGE-139.81Pointer + Coverage + EntailmentGen + QuestionGen
Abstractive Text SummarizationCNN / Daily MailROUGE-217.64Pointer + Coverage + EntailmentGen + QuestionGen
Abstractive Text SummarizationCNN / Daily MailROUGE-L36.54Pointer + Coverage + EntailmentGen + QuestionGen

Related Papers

SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Robust-Multi-Task Gradient Boosting2025-07-15SAMO: A Lightweight Sharpness-Aware Approach for Multi-Task Optimization with Joint Global-Local Perturbation2025-07-10Compressed and Smooth Latent Space for Text Diffusion Modeling2025-06-26Opportunistic Osteoporosis Diagnosis via Texture-Preserving Self-Supervision, Mixture of Experts and Multi-Task Integration2025-06-25AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation2025-06-24An Audio-centric Multi-task Learning Framework for Streaming Ads Targeting on Spotify2025-06-23SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning2025-06-18