Semantics-aware BERT for Language Understanding

Zhuosheng Zhang, Yuwei Wu, Hai Zhao, Zuchao Li, Shuailiang Zhang, Xi Zhou, Xiang Zhou

2019-09-05Reading Comprehension Question Answering Natural Language Inference Natural Language Understanding Word Embeddings Semantic Role Labeling Machine Reading Comprehension Language Modelling

Paper PDF Code(official)

Abstract

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.

Results

Task	Dataset	Metric	Value	Model
Question Answering	SQuAD2.0 dev	EM	80.9	SemBERT large
Question Answering	SQuAD2.0 dev	F1	83.6	SemBERT large
Question Answering	SQuAD2.0	EM	86.166	SemBERT(ensemble)
Question Answering	SQuAD2.0	F1	88.886	SemBERT(ensemble)
Question Answering	SQuAD2.0	EM	86.166	SemBERT(ensemble)
Question Answering	SQuAD2.0	F1	88.886	SemBERT(ensemble)
Question Answering	SQuAD2.0	EM	86.166	SemBERT (ensemble)
Question Answering	SQuAD2.0	F1	88.886	SemBERT (ensemble)
Question Answering	SQuAD2.0	EM	84.8	SemBERT (single model)
Question Answering	SQuAD2.0	F1	87.864	SemBERT (single model)
Question Answering	SQuAD2.0	EM	84.8	SemBERT (single model)
Question Answering	SQuAD2.0	F1	87.864	SemBERT (single model)
Natural Language Inference	SNLI	% Test Accuracy	91.9	SemBERT
Natural Language Inference	SNLI	% Train Accuracy	94.4	SemBERT

Semantics-aware BERT for Language Understanding

Abstract

Results

Related Papers

Semantics-aware BERT for Language Understanding

Abstract

Results

Related Papers