Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Llama-3.3-70B + CAPO

Llama-3.3-70B + CAPO

Reported on 7 benchmarks across 5 tasks · 1 paper · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing4 results

Sentiment AnalysisonSST-5 Fine-grained classification
Accuracy· 2025-04-22
62.27
SOTA
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
Subjectivity AnalysisonSUBJ
Accuracy· 2025-04-22
91.6
best: 97.34 (RoBERTa+DualCL)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
Text ClassificationonBala-Copa
Accuracy· 2025-04-22
98.27
best: 98.47 (Qwen2.5-32B + CAPO)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
Text ClassificationonAG News
Error· 2025-04-22
11.2
best: 4.45 (XLNet)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005

Methodology2 results

ClassificationonBala-Copa
Accuracy· 2025-04-22
98.27
best: 98.47 (Qwen2.5-32B + CAPO)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005
ClassificationonAG News
Error· 2025-04-22
11.2
best: 4.45 (XLNet)
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005

Reasoning1 result

Arithmetic ReasoningonGSM8K
Accuracy· 2025-04-22
73.73
best: 97.72 (Claude 3.5 Sonnet (HPT))
CAPO: Cost-Aware Prompt Optimization arXiv:2504.16005