TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Gemma-3-27B (10-shot, self-consistency learning)

Gemma-3-27B (10-shot, self-consistency learning)

Reported on 8 benchmarks across 2 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing8 results

  • Sentiment AnalysisonTASD
    F1 (R15)· 2025-02-18
    54.37
    best: 64.74 (MvP (multi-task))
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Sentiment AnalysisonTASD
    F1 (R16)· 2025-02-18
    66.75
    best: 72.76 (MvP)
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Sentiment AnalysisonASQP
    F1 (R15)· 2025-02-18
    39.95
    best: 52.21 (MvP (multi-task))
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Sentiment AnalysisonASQP
    F1 (R16)· 2025-02-18
    46.23
    best: 60.88 (AugABSA)
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Aspect-Based Sentiment Analysis (ABSA)onTASD
    F1 (R15)· 2025-02-18
    54.37
    best: 64.74 (MvP (multi-task))
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Aspect-Based Sentiment Analysis (ABSA)onTASD
    F1 (R16)· 2025-02-18
    66.75
    best: 72.76 (MvP)
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Aspect-Based Sentiment Analysis (ABSA)onASQP
    F1 (R15)· 2025-02-18
    39.95
    best: 52.21 (MvP (multi-task))
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044
  • Aspect-Based Sentiment Analysis (ABSA)onASQP
    F1 (R16)· 2025-02-18
    46.23
    best: 60.88 (AugABSA)
    Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad PredictionarXiv:2502.13044