Automated Concatenation of Embeddings for Structured Prediction

Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu

2020-10-10ACL 2021 5Structured Prediction Part-Of-Speech Tagging Aspect Extraction Neural Architecture Search Prediction Named Entity Recognition (NER)Chunking Dependency Parsing

Paper PDF Code Code(official)

Abstract

Pretrained contextualized embeddings are powerful word representations for structured prediction tasks. Recent work found that better word representations can be obtained by concatenating different types of embeddings. However, the selection of embeddings to form the best concatenated representation usually varies depending on the task and the collection of candidate embeddings, and the ever-increasing number of embedding types makes it a more difficult problem. In this paper, we propose Automated Concatenation of Embeddings (ACE) to automate the process of finding better concatenations of embeddings for structured prediction tasks, based on a formulation inspired by recent progress on neural architecture search. Specifically, a controller alternately samples a concatenation of embeddings, according to its current belief of the effectiveness of individual embedding types in consideration for a task, and updates the belief based on a reward. We follow strategies in reinforcement learning to optimize the parameters of the controller and compute the reward based on the accuracy of a task model, which is fed with the sampled concatenation as input and trained on a task dataset. Empirical results on 6 tasks and 21 datasets show that our approach outperforms strong baselines and achieves state-of-the-art performance with fine-tuned embeddings in all the evaluations.

Results

Task	Dataset	Metric	Value	Model
Part-Of-Speech Tagging	Ritter	Acc	93.4	ACE
Part-Of-Speech Tagging	ARK	Acc	94.4	ACE
Part-Of-Speech Tagging	Tweebank	Acc	95.8	ACE
Semantic Parsing	DM	In-domain	95.6	ACE
Semantic Parsing	DM	Out-of-domain	92.6	ACE
Semantic Parsing	PSD	In-domain	83.8	ACE
Semantic Parsing	PSD	Out-of-domain	83.4	ACE
Semantic Parsing	PAS	In-domain	95.8	ACE
Semantic Parsing	PAS	Out-of-domain	94.6	ACE
Dependency Parsing	Penn Treebank	LAS	95.8	ACE
Dependency Parsing	Penn Treebank	UAS	97.2	ACE
Sentiment Analysis	SemEval-2014 Task-4	Laptop (F1)	87.4	ACE
Sentiment Analysis	SemEval-2014 Task-4	Restaurant (F1)	92	ACE
Sentiment Analysis	SemEval 2015 Task 12	Restaurant (F1)	80.3	ACE
Named Entity Recognition (NER)	CoNLL 2003 (German)	F1	88.38	ACE + document-context
Named Entity Recognition (NER)	CoNLL 2003 (German)	F1	87	ACE
Named Entity Recognition (NER)	CoNLL 2003 (English)	F1	94.6	ACE + document-context
Named Entity Recognition (NER)	CoNLL 2003 (English)	F1	93.64	ACE
Named Entity Recognition (NER)	CoNLL 2002 (Spanish)	F1	95.9	ACE + document-context
Named Entity Recognition (NER)	CoNLL 2002 (Spanish)	F1	91.7	ACE
Named Entity Recognition (NER)	CoNLL 2002 (Dutch)	F1	95.7	ACE + document-context
Named Entity Recognition (NER)	CoNLL 2002 (Dutch)	F1	94.6	ACE
Named Entity Recognition (NER)	CoNLL 2003 (German) Revised	F1	91.7	ACE + document-context
Named Entity Recognition (NER)	CoNLL 2003 (German) Revised	F1	90.5	ACE
Chunking	CoNLL 2003 (German)	F1	95	ACE
Chunking	Penn Treebank	F1 score	97.3	ACE
Chunking	CoNLL 2000	Exact Span F1	97.3	ACE
Chunking	CoNLL 2003 (English)	F1	92.5	ACE
Aspect-Based Sentiment Analysis (ABSA)	SemEval-2014 Task-4	Laptop (F1)	87.4	ACE
Aspect-Based Sentiment Analysis (ABSA)	SemEval-2014 Task-4	Restaurant (F1)	92	ACE
Aspect-Based Sentiment Analysis (ABSA)	SemEval 2015 Task 12	Restaurant (F1)	80.3	ACE
Shallow Syntax	CoNLL 2003 (German)	F1	95	ACE
Shallow Syntax	Penn Treebank	F1 score	97.3	ACE
Shallow Syntax	CoNLL 2000	Exact Span F1	97.3	ACE
Shallow Syntax	CoNLL 2003 (English)	F1	92.5	ACE
Aspect Extraction	SemEval-2014 Task-4	Laptop (F1)	87.4	ACE
Aspect Extraction	SemEval-2014 Task-4	Restaurant (F1)	92	ACE
Aspect Extraction	SemEval 2015 Task 12	Restaurant (F1)	80.3	ACE

Automated Concatenation of Embeddings for Structured Prediction

Abstract

Results

Related Papers

Automated Concatenation of Embeddings for Structured Prediction

Abstract

Results

Related Papers