TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Code Generation/WebApp1K-React

Code Generation on WebApp1K-React

Metric: pass@1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕pass@1▼Extra DataPaperDate↕Code
1o1-preview0.952NoA Case Study of Web App Coding with OpenAI Reaso...2024-09-19Code
2o1-mini0.939NoA Case Study of Web App Coding with OpenAI Reaso...2024-09-19Code
3gpt-4o-2024-08-060.885NoInsights from Benchmarking Frontier Language Mod...2024-09-08Code
4claude-3.5-sonnet0.8808NoInsights from Benchmarking Frontier Language Mod...2024-09-08Code
5deepseek-v2.50.834NoA Case Study of Web App Coding with OpenAI Reaso...2024-09-19Code
6mistral-large-20.7804NoInsights from Benchmarking Frontier Language Mod...2024-09-08Code
7deepseek-coder-v2-instruct0.7002NoInsights from Benchmarking Frontier Language Mod...2024-09-08Code
8llama-v3p1-405b-instruct0.302NoInsights from Benchmarking Frontier Language Mod...2024-09-08Code