Zhibin Gou, Qingyan Guo, Yujiu Yang
Generative methods greatly promote aspect-based sentiment analysis via generating a sequence of sentiment elements in a specified format. However, existing studies usually predict sentiment elements in a fixed order, which ignores the effect of the interdependence of the elements in a sentiment tuple and the diversity of language expression on the results. In this work, we propose Multi-view Prompting (MvP) that aggregates sentiment elements generated in different orders, leveraging the intuition of human-like problem-solving processes from different views. Specifically, MvP introduces element order prompts to guide the language model to generate multiple sentiment tuples, each with a different element order, and then selects the most reasonable tuples by voting. MvP can naturally model multi-view and multi-task as permutations and combinations of elements, respectively, outperforming previous task-specific designed methods on multiple ABSA tasks with a single model. Extensive experiments show that MvP significantly advances the state-of-the-art performance on 10 datasets of 4 benchmark tasks, and performs quite effectively in low-resource settings. Detailed evaluation verified the effectiveness, flexibility, and cross-task transferability of MvP.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Sentiment Analysis | TASD | F1 (R15) | 64.74 | MvP (multi-task) |
| Sentiment Analysis | TASD | F1 (R16) | 70.18 | MvP (multi-task) |
| Sentiment Analysis | TASD | F1 (R15) | 64.53 | MvP |
| Sentiment Analysis | TASD | F1 (R16) | 72.76 | MvP |
| Sentiment Analysis | TASD | F1 (R16) | 46.51 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Sentiment Analysis | TASD | F1 (R16) | 34.08 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Sentiment Analysis | ASTE | F1 (L14) | 65.3 | MvP (multi-task) |
| Sentiment Analysis | ASTE | F1 (R15) | 69.44 | MvP (multi-task) |
| Sentiment Analysis | ASTE | F1 (R16) | 73.1 | MvP (multi-task) |
| Sentiment Analysis | ASTE | F1(R14) | 76.3 | MvP (multi-task) |
| Sentiment Analysis | ASTE | F1 (L14) | 63.33 | MvP |
| Sentiment Analysis | ASTE | F1 (R15) | 65.89 | MvP |
| Sentiment Analysis | ASTE | F1 (R16) | 73.48 | MvP |
| Sentiment Analysis | ASTE | F1(R14) | 74.05 | MvP |
| Sentiment Analysis | ASTE | F1 (L14) | 38.12 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Sentiment Analysis | ASTE | F1 (L14) | 36.05 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Sentiment Analysis | ACOS | F1 (Laptop) | 43.92 | MvP |
| Sentiment Analysis | ACOS | F1 (Restaurant) | 61.54 | MvP |
| Sentiment Analysis | ACOS | F1 (Laptop) | 43.84 | MvP (muilti-task) |
| Sentiment Analysis | ACOS | F1 (Restaurant) | 60.36 | MvP (muilti-task) |
| Sentiment Analysis | ACOS | F1 (Restaurant) | 37.71 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Sentiment Analysis | ACOS | F1 (Restaurant) | 27.11 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Sentiment Analysis | ASQP | F1 (R15) | 52.21 | MvP (multi-task) |
| Sentiment Analysis | ASQP | F1 (R16) | 58.94 | MvP (multi-task) |
| Sentiment Analysis | ASQP | F1 (R15) | 51.04 | MvP |
| Sentiment Analysis | ASQP | F1 (R16) | 60.39 | MvP |
| Sentiment Analysis | ASQP | F1 (R15) | 34.27 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Sentiment Analysis | ASQP | F1 (R15) | 22.87 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | TASD | F1 (R15) | 64.74 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | TASD | F1 (R16) | 70.18 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | TASD | F1 (R15) | 64.53 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | TASD | F1 (R16) | 72.76 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | TASD | F1 (R16) | 46.51 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | TASD | F1 (R16) | 34.08 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (L14) | 65.3 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (R15) | 69.44 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (R16) | 73.1 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1(R14) | 76.3 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (L14) | 63.33 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (R15) | 65.89 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (R16) | 73.48 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1(R14) | 74.05 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (L14) | 38.12 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | ASTE | F1 (L14) | 36.05 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | ACOS | F1 (Laptop) | 43.92 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ACOS | F1 (Restaurant) | 61.54 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ACOS | F1 (Laptop) | 43.84 | MvP (muilti-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ACOS | F1 (Restaurant) | 60.36 | MvP (muilti-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ACOS | F1 (Restaurant) | 37.71 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | ACOS | F1 (Restaurant) | 27.11 | ChatGPT (gpt-3.5-turbo, zero-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | ASQP | F1 (R15) | 52.21 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ASQP | F1 (R16) | 58.94 | MvP (multi-task) |
| Aspect-Based Sentiment Analysis (ABSA) | ASQP | F1 (R15) | 51.04 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ASQP | F1 (R16) | 60.39 | MvP |
| Aspect-Based Sentiment Analysis (ABSA) | ASQP | F1 (R15) | 34.27 | ChatGPT (gpt-3.5-turbo, few-shot) |
| Aspect-Based Sentiment Analysis (ABSA) | ASQP | F1 (R15) | 22.87 | ChatGPT (gpt-3.5-turbo, zero-shot) |