TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Automated Concatenation of Embeddings for Structured Predi...

Automated Concatenation of Embeddings for Structured Prediction

Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu

2020-10-10ACL 2021 5Structured PredictionPart-Of-Speech TaggingAspect ExtractionNeural Architecture SearchPredictionNamed Entity Recognition (NER)ChunkingDependency Parsing
PaperPDFCodeCode(official)

Abstract

Pretrained contextualized embeddings are powerful word representations for structured prediction tasks. Recent work found that better word representations can be obtained by concatenating different types of embeddings. However, the selection of embeddings to form the best concatenated representation usually varies depending on the task and the collection of candidate embeddings, and the ever-increasing number of embedding types makes it a more difficult problem. In this paper, we propose Automated Concatenation of Embeddings (ACE) to automate the process of finding better concatenations of embeddings for structured prediction tasks, based on a formulation inspired by recent progress on neural architecture search. Specifically, a controller alternately samples a concatenation of embeddings, according to its current belief of the effectiveness of individual embedding types in consideration for a task, and updates the belief based on a reward. We follow strategies in reinforcement learning to optimize the parameters of the controller and compute the reward based on the accuracy of a task model, which is fed with the sampled concatenation as input and trained on a task dataset. Empirical results on 6 tasks and 21 datasets show that our approach outperforms strong baselines and achieves state-of-the-art performance with fine-tuned embeddings in all the evaluations.

Results

TaskDatasetMetricValueModel
Part-Of-Speech TaggingRitterAcc93.4ACE
Part-Of-Speech TaggingARKAcc94.4ACE
Part-Of-Speech TaggingTweebankAcc95.8ACE
Semantic ParsingDMIn-domain95.6ACE
Semantic ParsingDMOut-of-domain92.6ACE
Semantic ParsingPSDIn-domain83.8ACE
Semantic ParsingPSDOut-of-domain83.4ACE
Semantic ParsingPASIn-domain95.8ACE
Semantic ParsingPASOut-of-domain94.6ACE
Dependency ParsingPenn TreebankLAS95.8ACE
Dependency ParsingPenn TreebankUAS97.2ACE
Sentiment AnalysisSemEval-2014 Task-4Laptop (F1)87.4ACE
Sentiment AnalysisSemEval-2014 Task-4Restaurant (F1)92ACE
Sentiment Analysis SemEval 2015 Task 12Restaurant (F1)80.3ACE
Named Entity Recognition (NER)CoNLL 2003 (German)F188.38ACE + document-context
Named Entity Recognition (NER)CoNLL 2003 (German)F187ACE
Named Entity Recognition (NER)CoNLL 2003 (English)F194.6ACE + document-context
Named Entity Recognition (NER)CoNLL 2003 (English)F193.64ACE
Named Entity Recognition (NER)CoNLL 2002 (Spanish)F195.9ACE + document-context
Named Entity Recognition (NER)CoNLL 2002 (Spanish)F191.7ACE
Named Entity Recognition (NER)CoNLL 2002 (Dutch)F195.7ACE + document-context
Named Entity Recognition (NER)CoNLL 2002 (Dutch)F194.6ACE
Named Entity Recognition (NER)CoNLL 2003 (German) RevisedF191.7ACE + document-context
Named Entity Recognition (NER)CoNLL 2003 (German) RevisedF190.5ACE
ChunkingCoNLL 2003 (German)F195ACE
ChunkingPenn TreebankF1 score97.3ACE
ChunkingCoNLL 2000Exact Span F197.3ACE
ChunkingCoNLL 2003 (English)F192.5ACE
Aspect-Based Sentiment Analysis (ABSA)SemEval-2014 Task-4Laptop (F1)87.4ACE
Aspect-Based Sentiment Analysis (ABSA)SemEval-2014 Task-4Restaurant (F1)92ACE
Aspect-Based Sentiment Analysis (ABSA) SemEval 2015 Task 12Restaurant (F1)80.3ACE
Shallow SyntaxCoNLL 2003 (German)F195ACE
Shallow SyntaxPenn TreebankF1 score97.3ACE
Shallow SyntaxCoNLL 2000Exact Span F197.3ACE
Shallow SyntaxCoNLL 2003 (English)F192.5ACE
Aspect ExtractionSemEval-2014 Task-4Laptop (F1)87.4ACE
Aspect ExtractionSemEval-2014 Task-4Restaurant (F1)92ACE
Aspect Extraction SemEval 2015 Task 12Restaurant (F1)80.3ACE

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11Dynamic Chunking for End-to-End Hierarchical Sequence Modeling2025-07-10Foundation models for time series forecasting: Application in conformal prediction2025-07-09CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs2025-07-09Predicting Graph Structure via Adapted Flux Balance Analysis2025-07-08