TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Llama-3.3-70B + CAPO

Llama-3.3-70B + CAPO

Reported on 7 benchmarks across 5 tasks · 1 paper · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing4 results

  • Sentiment AnalysisonSST-5 Fine-grained classification
    Accuracy· 2025-04-22
    62.27
    SOTA
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • Subjectivity AnalysisonSUBJ
    Accuracy· 2025-04-22
    91.6
    best: 97.34 (RoBERTa+DualCL)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • Text ClassificationonBala-Copa
    Accuracy· 2025-04-22
    98.27
    best: 98.47 (Qwen2.5-32B + CAPO)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • Text ClassificationonAG News
    Error· 2025-04-22
    11.2
    best: 4.45 (XLNet)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005

Methodology2 results

  • ClassificationonBala-Copa
    Accuracy· 2025-04-22
    98.27
    best: 98.47 (Qwen2.5-32B + CAPO)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • ClassificationonAG News
    Error· 2025-04-22
    11.2
    best: 4.45 (XLNet)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005

Reasoning1 result

  • Arithmetic ReasoningonGSM8K
    Accuracy· 2025-04-22
    73.73
    best: 97.72 (Claude 3.5 Sonnet (HPT))
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005