TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Phase Conductor on Multi-layered Attentions for Machine Co...

Phase Conductor on Multi-layered Attentions for Machine Comprehension

Rui Liu, Wei Wei, Weiguang Mao, Maria Chikina

2017-10-28ICLR 2018 1Reading ComprehensionQuestion Answering
PaperPDF

Abstract

Attention models have been intensively studied to improve NLP tasks such as machine comprehension via both question-aware passage attention model and self-matching attention model. Our research proposes phase conductor (PhaseCond) for attention models in two meaningful ways. First, PhaseCond, an architecture of multi-layered attention models, consists of multiple phases each implementing a stack of attention layers producing passage representations and a stack of inner or outer fusion layers regulating the information flow. Second, we extend and improve the dot-product attention function for PhaseCond by simultaneously encoding multiple question and passage embedding layers from different perspectives. We demonstrate the effectiveness of our proposed model PhaseCond on the SQuAD dataset, showing that our model significantly outperforms both state-of-the-art single-layered and multiple-layered attention models. We deepen our results with new findings via both detailed qualitative analysis and visualized examples showing the dynamic changes through multi-layered attention models.

Results

TaskDatasetMetricValueModel
Question AnsweringSQuAD1.1 devEM72.1PhaseCond (single)
Question AnsweringSQuAD1.1 devF181.4PhaseCond (single)
Question AnsweringSQuAD1.1EM76.996Conductor-net (ensemble)
Question AnsweringSQuAD1.1F184.63Conductor-net (ensemble)
Question AnsweringSQuAD1.1EM74.405Conductor-net (single model)
Question AnsweringSQuAD1.1F182.742Conductor-net (single model)
Question AnsweringSQuAD1.1EM73.24Conductor-net (single)
Question AnsweringSQuAD1.1F181.933Conductor-net (single)

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09