Multi-Task Deep Neural Networks for Natural Language Understanding

Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao

2019-01-31ACL 2019 7Paraphrase Identification Sentiment Analysis Natural Language Inference Natural Language Understanding Linguistic Acceptability Language Modelling Domain Adaptation

Paper PDF Code Code Code(official)Code Code Code Code

Abstract

In this paper, we present a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks. MT-DNN not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations in order to adapt to new tasks and domains. MT-DNN extends the model proposed in Liu et al. (2015) by incorporating a pre-trained bidirectional transformer language model, known as BERT (Devlin et al., 2018). MT-DNN obtains new state-of-the-art results on ten NLU tasks, including SNLI, SciTail, and eight out of nine GLUE tasks, pushing the GLUE benchmark to 82.7% (2.2% absolute improvement). We also demonstrate using the SNLI and SciTail datasets that the representations learned by MT-DNN allow domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations. The code and pre-trained models are publicly available at https://github.com/namisan/mt-dnn.

Results

Task	Dataset	Metric	Value	Model
Natural Language Inference	SciTail	Accuracy	94.1	MT-DNN
Natural Language Inference	SNLI	% Test Accuracy	91.6	MT-DNN
Natural Language Inference	SNLI	% Train Accuracy	97.2	MT-DNN
Natural Language Inference	SNLI	% Test Accuracy	90.5	Ntumpha
Natural Language Inference	SNLI	% Train Accuracy	99.1	Ntumpha
Natural Language Inference	SNLI	Parameters	220	Ntumpha
Natural Language Inference	MultiNLI	Matched	86.7	MT-DNN
Natural Language Inference	MultiNLI	Mismatched	86	MT-DNN
Semantic Textual Similarity	Quora Question Pairs	Accuracy	89.6	MT-DNN
Semantic Textual Similarity	Quora Question Pairs	F1	72.4	MT-DNN
Sentiment Analysis	SST-2 Binary classification	Accuracy	95.6	MT-DNN
Paraphrase Identification	Quora Question Pairs	Accuracy	89.6	MT-DNN
Paraphrase Identification	Quora Question Pairs	F1	72.4	MT-DNN

Multi-Task Deep Neural Networks for Natural Language Understanding

Abstract

Results

Related Papers

Multi-Task Deep Neural Networks for Natural Language Understanding

Abstract

Results

Related Papers