Efficient Test Time Adapter Ensembling for Low-resource Language Varieties

Xinyi Wang, Yulia Tsvetkov, Sebastian Ruder, Graham Neubig

2021-09-10Findings (EMNLP) 2021 11Part-Of-Speech Tagging named-entity-recognition Cross-Lingual Transfer Named Entity Recognition parameter-efficient fine-tuning Named Entity Recognition (NER)

Paper PDF Code(official)

Abstract

Adapters are light-weight modules that allow parameter-efficient fine-tuning of pretrained models. Specialized language and task adapters have recently been proposed to facilitate cross-lingual transfer of multilingual pretrained models (Pfeiffer et al., 2020b). However, this approach requires training a separate language adapter for every language one wishes to support, which can be impractical for languages with limited data. An intuitive solution is to use a related language adapter for the new language variety, but we observe that this solution can lead to sub-optimal performance. In this paper, we aim to improve the robustness of language adapters to uncovered languages without training new adapters. We find that ensembling multiple existing language adapters makes the fine-tuned model significantly more robust to other language varieties not included in these adapters. Building upon this observation, we propose Entropy Minimized Ensemble of Adapters (EMEA), a method that optimizes the ensemble weights of the pretrained language adapters for each test sentence by minimizing the entropy of its predictions. Experiments on three diverse groups of language varieties show that our method leads to significant improvements on both named entity recognition and part-of-speech tagging across all languages.

Related Papers

Enhancing Cross-task Transfer of Large Language Models via Activation Steering2025-07-17 Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17 HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training2025-07-15 Flippi: End To End GenAI Assistant for E-Commerce2025-07-08 LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization2025-07-06 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28 Exploring Adapter Design Tradeoffs for Low Resource Music Generation2025-06-26 WordCon: Word-level Typography Control in Scene Text Rendering2025-06-26