Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Qwen2.5-32B + CAPO

Qwen2.5-32B + CAPO

Reported on 7 benchmarks across 5 tasks · 1 paper · 2 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing4 results

Text ClassificationonBala-Copa
Accuracy· 2025-04-22
98.47
SOTA
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
Sentiment AnalysisonSST-5 Fine-grained classification
Accuracy · 2025-04-22
59.07
best: 60.2 (Mistral-Small-24B + CAPO)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
Subjectivity AnalysisonSUBJ
Accuracy· 2025-04-22
91
best: 97.34 (RoBERTa+DualCL)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
Text ClassificationonAG News
Error· 2025-04-22
12.93
best: 4.45 (XLNet)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005

Methodology2 results

ClassificationonBala-Copa
Accuracy· 2025-04-22
98.47
SOTA
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
ClassificationonAG News
Error· 2025-04-22
12.93
best: 4.45 (XLNet)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005

Reasoning1 result

Arithmetic ReasoningonGSM8K
Accuracy· 2025-04-22
60.2
best: 97.72 (Claude 3.5 Sonnet (HPT))
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005