TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/XLM-T: Multilingual Language Models in Twitter for Sentime...

XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond

Francesco Barbieri, Luis Espinosa Anke, Jose Camacho-Collados

2021-04-25LREC 2022 6Sentiment AnalysisXLM-RLanguage Modelling
PaperPDFCode(official)

Abstract

Language models are ubiquitous in current NLP, and their multilingual capacity has recently attracted considerable attention. However, current analyses have almost exclusively focused on (multilingual variants of) standard benchmarks, and have relied on clean pre-training and task-specific corpora as multilingual signals. In this paper, we introduce XLM-T, a model to train and evaluate multilingual language models in Twitter. In this paper we provide: (1) a new strong multilingual baseline consisting of an XLM-R (Conneau et al. 2020) model pre-trained on millions of tweets in over thirty languages, alongside starter code to subsequently fine-tune on a target task; and (2) a set of unified sentiment analysis Twitter datasets in eight different languages and a XLM-T model fine-tuned on them.

Results

TaskDatasetMetricValueModel
Sentiment AnalysisTweetEvalALL65.2RoB-RT
Sentiment AnalysisTweetEvalEmoji31.4RoB-RT
Sentiment AnalysisTweetEvalEmotion79.5RoB-RT
Sentiment AnalysisTweetEvalHate52.3RoB-RT
Sentiment AnalysisTweetEvalIrony61.7RoB-RT
Sentiment AnalysisTweetEvalOffensive80.5RoB-RT
Sentiment AnalysisTweetEvalSentiment72.6RoB-RT
Sentiment AnalysisTweetEvalStance69.3RoB-RT

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Assay2Mol: large language model-based drug design using BioAssay context2025-07-16Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16