A Fully Attention-Based Information Retriever

Alvaro Henrique Chaim Correia, Jorge Luiz Moreira Silva, Thiago de Castro Martins, Fabio Gagliardi Cozman

2018-10-22Question Answering

Abstract

Recurrent neural networks are now the state-of-the-art in natural language processing because they can build rich contextual representations and process texts of arbitrary length. However, recent developments on attention mechanisms have equipped feedforward networks with similar capabilities, hence enabling faster computations due to the increase in the number of operations that can be parallelized. We explore this new type of architecture in the domain of question-answering and propose a novel approach that we call Fully Attention Based Information Retriever (FABIR). We show that FABIR achieves competitive results in the Stanford Question Answering Dataset (SQuAD) while having fewer parameters and being faster at both learning and inference than rival methods.

Results

Task	Dataset	Metric	Value	Model
Question Answering	SQuAD1.1 dev	EM	65.1	FABIR
Question Answering	SQuAD1.1 dev	F1	75.6	FABIR
Question Answering	SQuAD1.1	EM	67.744	FABIR
Question Answering	SQuAD1.1	F1	77.605	FABIR

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17 Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17 Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17 City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17 Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16 Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16 Warehouse Spatial Question Answering with LLM Agent2025-07-14 Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09