UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis

Dou Hu, Lingwei Wei, Yaxin Liu, Wei Zhou, Songlin Hu

2023-06-01Representation Learning Zero-shot Sentiment Classification Sentiment Analysis Sentiment Classification Contrastive Learning Zero-Shot Learning

Paper PDF Code

Abstract

This paper describes our system designed for SemEval-2023 Task 12: Sentiment analysis for African languages. The challenge faced by this task is the scarcity of labeled data and linguistic resources in low-resource settings. To alleviate these, we propose a generalized multilingual system SACL-XLMR for sentiment analysis on low-resource languages. Specifically, we design a lexicon-based multilingual BERT to facilitate language adaptation and sentiment-aware representation learning. Besides, we apply a supervised adversarial contrastive learning technique to learn sentiment-spread structured representations and enhance model generalization. Our system achieved competitive results, largely outperforming baselines on both multilingual and zero-shot sentiment classification subtasks. Notably, the system obtained the 1st rank on the zero-shot classification subtask in the official ranking. Extensive experiments demonstrate the effectiveness of our system.

Results

Task	Dataset	Metric	Value	Model
Zero-shot Sentiment Classification	AfriSenti	weighted-F1 score	0.589	SACL-XLMR
Zero-shot Sentiment Classification	AfriSenti	weighted-F1 score	0.561	AfroXLMR
Zero-shot Sentiment Classification	AfriSenti	weighted-F1 score	0.439	AfriBERTa
Zero-shot Sentiment Classification	AfriSenti	weighted-F1 score	0.399	XLM-R
Zero-shot Sentiment Classification	AfriSenti	weighted-F1 score	0.34	Random

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17 AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17 SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17 HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17 Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17 SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17