Stochastic Answer Networks for Machine Reading Comprehension

Xiaodong Liu, Yelong Shen, Kevin Duh, Jianfeng Gao

2017-12-10ACL 2018 7Reading Comprehension Question Answering Reinforcement Learning Machine Reading Comprehension reinforcement-learning

Paper PDF Code Code Code Code Code Code

Abstract

We propose a simple yet robust stochastic answer network (SAN) that simulates multi-step reasoning in machine reading comprehension. Compared to previous work such as ReasoNet which used reinforcement learning to determine the number of steps, the unique feature is the use of a kind of stochastic prediction dropout on the answer module (final layer) of the neural network during the training. We show that this simple trick improves robustness and achieves results competitive to the state-of-the-art on the Stanford Question Answering Dataset (SQuAD), the Adversarial SQuAD, and the Microsoft MAchine Reading COmprehension Dataset (MS MARCO).

Results

Task	Dataset	Metric	Value	Model
Question Answering	SQuAD1.1 dev	EM	76.235	SAN (single)
Question Answering	SQuAD1.1 dev	F1	84.056	SAN (single)
Question Answering	SQuAD1.1	EM	79.608	SAN (ensemble model)
Question Answering	SQuAD1.1	F1	86.496	SAN (ensemble model)
Question Answering	SQuAD1.1	EM	76.828	SAN (single model)
Question Answering	SQuAD1.1	F1	84.396	SAN (single model)
Question Answering	SQuAD2.0	EM	71.316	SAN (ensemble model)
Question Answering	SQuAD2.0	F1	73.704	SAN (ensemble model)
Question Answering	SQuAD2.0	EM	68.653	SAN (single model)
Question Answering	SQuAD2.0	F1	71.439	SAN (single model)

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18 From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17 Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17 Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17 City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17