TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Qwen2.5-32B + CAPO

Qwen2.5-32B + CAPO

Reported on 7 benchmarks across 5 tasks · 1 paper · 2 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing4 results

  • Text ClassificationonBala-Copa
    Accuracy· 2025-04-22
    98.47
    SOTA
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • Sentiment AnalysisonSST-5 Fine-grained classification
    Accuracy · 2025-04-22
    59.07
    best: 60.2 (Mistral-Small-24B + CAPO)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • Subjectivity AnalysisonSUBJ
    Accuracy· 2025-04-22
    91
    best: 97.34 (RoBERTa+DualCL)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • Text ClassificationonAG News
    Error· 2025-04-22
    12.93
    best: 4.45 (XLNet)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005

Methodology2 results

  • ClassificationonBala-Copa
    Accuracy· 2025-04-22
    98.47
    SOTA
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005
  • ClassificationonAG News
    Error· 2025-04-22
    12.93
    best: 4.45 (XLNet)
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005

Reasoning1 result

  • Arithmetic ReasoningonGSM8K
    Accuracy· 2025-04-22
    60.2
    best: 97.72 (Claude 3.5 Sonnet (HPT))
    CAPO: Cost-Aware Prompt OptimizationarXiv:2504.16005