Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral
We introduce InstructABSA, an instruction learning paradigm for Aspect-Based Sentiment Analysis (ABSA) subtasks. Our method introduces positive, negative, and neutral examples to each training sample, and instruction tune the model (Tk-Instruct) for ABSA subtasks, yielding significant performance improvements. Experimental results on the Sem Eval 2014, 15, and 16 datasets demonstrate that InstructABSA outperforms the previous state-of-the-art (SOTA) approaches on Term Extraction (ATE), Sentiment Classification(ATSC) and Sentiment Pair Extraction (ASPE) subtasks. In particular, InstructABSA outperforms the previous state-of-the-art (SOTA) on the Rest14 ATE subtask by 5.69% points, the Rest15 ATSC subtask by 9.59% points, and the Lapt14 AOPE subtask by 3.37% points, surpassing 7x larger models. We also get competitive results on AOOE, AOPE, and AOSTE subtasks indicating strong generalization ability to all subtasks. Exploring sample efficiency reveals that just 50% train data is required to get competitive results with other instruction tuning approaches. Lastly, we assess the quality of instructions and observe that InstructABSA's performance experiences a decline of ~10% when adding misleading examples.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Sentiment Analysis | SemEval 2014 Task 4 Subtask 1+2 | F1 | 79.34 | InstructABSA |
| Sentiment Analysis | SemEval-2014 Task-4 | Laptop (Acc) | 80.56 | InstructABSA |
| Sentiment Analysis | SemEval-2014 Task-4 | Mean Acc (Restaurant + Laptop) | 81.5 | InstructABSA |
| Sentiment Analysis | SemEval-2014 Task-4 | Restaurant (Acc) | 82.44 | InstructABSA |
| Sentiment Analysis | SemEval 2014 Task 4 Subtask 1+2 | F1 | 79.34 | InstructABSA |
| Sentiment Analysis | SemEval 2014 Task 4 Sub Task 1 | Laptop (F1) | 92.3 | InstructABSA |
| Sentiment Analysis | SemEval 2014 Task 4 Sub Task 1 | Restaurant (F1) | 92.76 | InstructABSA |
| Sentiment Analysis | SemEval 2014 Task 4 Laptop | F1 | 79.34 | InstructABSA |
| Sentiment Analysis | SemEval-2014 Task-4 | Laptop (F1) | 92.3 | InstructABSA |
| Sentiment Analysis | SemEval-2014 Task-4 | Mean F1 (Laptop + Restaurant) | 92.53 | InstructABSA |
| Sentiment Analysis | SemEval-2014 Task-4 | Restaurant (F1) | 92.76 | InstructABSA |
| Sentiment Analysis | SemEval 2014 Task 4 Sub Task 1 | Laptop (F1) | 92.3 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Laptop (Acc) | 80.56 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Mean Acc (Restaurant + Laptop) | 81.5 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Restaurant (Acc) | 82.44 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval 2014 Task 4 Subtask 1+2 | F1 | 79.34 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval 2014 Task 4 Sub Task 1 | Laptop (F1) | 92.3 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval 2014 Task 4 Sub Task 1 | Restaurant (F1) | 92.76 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval 2014 Task 4 Laptop | F1 | 79.34 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Laptop (F1) | 92.3 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Mean F1 (Laptop + Restaurant) | 92.53 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Restaurant (F1) | 92.76 | InstructABSA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval 2014 Task 4 Sub Task 1 | Laptop (F1) | 92.3 | InstructABSA |
| Aspect Extraction | SemEval-2014 Task-4 | Laptop (F1) | 92.3 | InstructABSA |
| Aspect Extraction | SemEval-2014 Task-4 | Mean F1 (Laptop + Restaurant) | 92.53 | InstructABSA |
| Aspect Extraction | SemEval-2014 Task-4 | Restaurant (F1) | 92.76 | InstructABSA |
| Aspect Extraction | SemEval 2014 Task 4 Sub Task 1 | Laptop (F1) | 92.3 | InstructABSA |