Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Gemma-3-27B (10-shot, self-consistency learning)

Gemma-3-27B (10-shot, self-consistency learning)

Reported on 8 benchmarks across 2 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing8 results

Sentiment AnalysisonTASD
F1 (R15)· 2025-02-18
54.37
best: 64.74 (MvP (multi-task))
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Sentiment AnalysisonTASD
F1 (R16)· 2025-02-18
66.75
best: 72.76 (MvP)
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Sentiment AnalysisonASQP
F1 (R15)· 2025-02-18
39.95
best: 52.21 (MvP (multi-task))
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Sentiment AnalysisonASQP
F1 (R16)· 2025-02-18
46.23
best: 60.88 (AugABSA)
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Aspect-Based Sentiment Analysis (ABSA)onTASD
F1 (R15)· 2025-02-18
54.37
best: 64.74 (MvP (multi-task))
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Aspect-Based Sentiment Analysis (ABSA)onTASD
F1 (R16)· 2025-02-18
66.75
best: 72.76 (MvP)
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Aspect-Based Sentiment Analysis (ABSA)onASQP
F1 (R15)· 2025-02-18
39.95
best: 52.21 (MvP (multi-task))
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044
Aspect-Based Sentiment Analysis (ABSA)onASQP
F1 (R16)· 2025-02-18
46.23
best: 60.88 (AugABSA)
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction arXiv:2502.13044