LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto

2020-10-02EMNLP 2020 11Question Answering Relation Extraction Common Sense Reasoning Extractive Question-Answering Named Entity Recognition Relation Classification Named Entity Recognition (NER)Entity Typing Language Modelling

Paper PDF Code Code Code Code Code Code(official)Code Code Code

Abstract

Entity representations are useful in natural language tasks involving entities. In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer. The proposed model treats words and entities in a given text as independent tokens, and outputs contextualized representations of them. Our model is trained using a new pretraining task based on the masked language model of BERT. The task involves predicting randomly masked words and entities in a large entity-annotated corpus retrieved from Wikipedia. We also propose an entity-aware self-attention mechanism that is an extension of the self-attention mechanism of the transformer, and considers the types of tokens (words or entities) when computing attention scores. The proposed model achieves impressive empirical performance on a wide range of entity-related tasks. In particular, it obtains state-of-the-art results on five well-known datasets: Open Entity (entity typing), TACRED (relation classification), CoNLL-2003 (named entity recognition), ReCoRD (cloze-style question answering), and SQuAD 1.1 (extractive question answering). Our source code and pretrained representations are available at https://github.com/studio-ousia/luke.

Results

Task	Dataset	Metric	Value	Model
Relation Extraction	TACRED	F1 (1% Few-Shot)	17	LUKE
Relation Extraction	TACRED	F1 (5% Few-Shot)	51.6	LUKE
Relation Extraction	TACRED	F1	72.7	LUKE 483M
Relation Classification	TACRED	F1	72.7	LUKE 483M
Question Answering	SQuAD1.1 dev	EM	89.8	LUKE
Question Answering	SQuAD1.1 dev	F1	95	LUKE 483M
Question Answering	SQuAD1.1	EM	90.202	LUKE (single model)
Question Answering	SQuAD1.1	F1	95.379	LUKE (single model)
Question Answering	SQuAD1.1	EM	90.202	LUKE (single model)
Question Answering	SQuAD1.1	F1	95.379	LUKE (single model)
Question Answering	SQuAD1.1	EM	90.2	LUKE
Question Answering	SQuAD1.1	F1	95.4	LUKE 483M
Question Answering	SQuAD2.0	EM	87.429	LUKE (single model)
Question Answering	SQuAD2.0	F1	90.163	LUKE (single model)
Question Answering	SQuAD2.0	EM	87.429	LUKE (single model)
Question Answering	SQuAD2.0	F1	90.163	LUKE (single model)
Question Answering	SQuAD2.0	F1	90.2	LUKE 483M
Common Sense Reasoning	ReCoRD	EM	90.6	LUKE 483M
Common Sense Reasoning	ReCoRD	F1	91.2	LUKE 483M
Named Entity Recognition (NER)	CoNLL 2003 (English)	F1	94.3	LUKE 483M
Named Entity Recognition (NER)	CoNLL++	F1	95.89	LUKE（Large）
Entity Typing	Open Entity	F1	78.2	MLMET

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Abstract

Results

Related Papers

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Abstract

Results

Related Papers