Large language models for aspect-based sentiment analysis

Paul F. Simmering, Paavo Huoviala

2023-10-27Sentiment Analysis Aspect-Based Sentiment Analysis Prompt Engineering Aspect-Based Sentiment Analysis (ABSA)Term Extraction

Paper PDF Code(official)

Abstract

Large language models (LLMs) offer unprecedented text completion capabilities. As general models, they can fulfill a wide range of roles, including those of more specialized models. We assess the performance of GPT-4 and GPT-3.5 in zero shot, few shot and fine-tuned settings on the aspect-based sentiment analysis (ABSA) task. Fine-tuned GPT-3.5 achieves a state-of-the-art F1 score of 83.8 on the joint aspect term extraction and polarity classification task of the SemEval-2014 Task 4, improving upon InstructABSA [@scaria_instructabsa_2023] by 5.7%. However, this comes at the price of 1000 times more model parameters and thus increased inference cost. We discuss the the cost-performance trade-offs of different models, and analyze the typical errors that they make. Our results also indicate that detailed prompts improve performance in zero-shot and few-shot settings but are not necessary for fine-tuned models. This evidence is relevant for practioners that are faced with the choice of prompt engineering versus fine-tuning when using LLMs for ABSA.

Results

Task	Dataset	Metric	Value	Model
Sentiment Analysis	SemEval 2014 Task 4 Subtask 1+2	F1	83.76	gpt-3.5 finetuned
Aspect-Based Sentiment Analysis (ABSA)	SemEval 2014 Task 4 Subtask 1+2	F1	83.76	gpt-3.5 finetuned

Related Papers

AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17 Leveraging Language Prior for Infrared Small Target Detection2025-07-17 Emotional Support with LLM-based Empathetic Dialogue Generation2025-07-17 AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles2025-07-15 DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15 SentiDrop: A Multi Modal Machine Learning model for Predicting Dropout in Distance Learning2025-07-14 Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges2025-07-13 GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10