Neural Attentive Bag-of-Entities Model for Text Classification

Ikuya Yamada, Hiroyuki Shindo

2019-09-03CONLL 2019 11Text Classification Question Answering General Classification Classification

Abstract

This study proposes a Neural Attentive Bag-of-Entities model, which is a neural network model that performs text classification using entities in a knowledge base. Entities provide unambiguous and relevant semantic signals that are beneficial for capturing semantics in texts. We combine simple high-recall entity detection based on a dictionary, to detect entities in a document, with a novel neural attention mechanism that enables the model to focus on a small number of unambiguous and relevant entities. We tested the effectiveness of our model using two standard text classification datasets (i.e., the 20 Newsgroups and R8 datasets) and a popular factoid question answering dataset based on a trivia quiz game. As a result, our model achieved state-of-the-art results on all datasets. The source code of the proposed model is available online at https://github.com/wikipedia2vec/wikipedia2vec.

Results

Task	Dataset	Metric	Value	Model
Text Classification	R8	Accuracy	97.1	NABoE-full
Text Classification	R8	F-measure	91.7	NABoE-full
Text Classification	20NEWS	Accuracy	86.8	NABoE-full
Text Classification	20NEWS	F-measure	86.2	NABoE-full
Classification	R8	Accuracy	97.1	NABoE-full
Classification	R8	F-measure	91.7	NABoE-full
Classification	20NEWS	Accuracy	86.8	NABoE-full
Classification	20NEWS	F-measure	86.2	NABoE-full

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17 From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17 Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17 Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17 City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17 Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17 Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16 Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16