Alexander Rietzler, Sebastian Stabinger, Paul Opitz, Stefan Engl
Aspect-Target Sentiment Classification (ATSC) is a subtask of Aspect-Based Sentiment Analysis (ABSA), which has many applications e.g. in e-commerce, where data and insights from reviews can be leveraged to create value for businesses and customers. Recently, deep transfer-learning methods have been applied successfully to a myriad of Natural Language Processing (NLP) tasks, including ATSC. Building on top of the prominent BERT language model, we approach ATSC using a two-step procedure: self-supervised domain-specific BERT language model finetuning, followed by supervised task-specific finetuning. Our findings on how to best exploit domain-specific language model finetuning enable us to produce new state-of-the-art performance on the SemEval 2014 Task 4 restaurants dataset. In addition, to explore the real-world robustness of our models, we perform cross-domain evaluation. We show that a cross-domain adapted BERT language model performs significantly better than strong baseline models like vanilla BERT-base and XLNet-base. Finally, we conduct a case study to interpret model prediction errors.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Sentiment Analysis | SemEval-2014 Task-4 | Laptop (Acc) | 80.23 | BERT-ADA |
| Sentiment Analysis | SemEval-2014 Task-4 | Mean Acc (Restaurant + Laptop) | 84.06 | BERT-ADA |
| Sentiment Analysis | SemEval-2014 Task-4 | Restaurant (Acc) | 87.89 | BERT-ADA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Laptop (Acc) | 80.23 | BERT-ADA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Mean Acc (Restaurant + Laptop) | 84.06 | BERT-ADA |
| Aspect-Based Sentiment Analysis (ABSA) | SemEval-2014 Task-4 | Restaurant (Acc) | 87.89 | BERT-ADA |