Code Llama - Python 70B (3-shot)

Reported on 1 benchmark across 1 task · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing1 result

Code GenerationonMBPP
Accuracy· uses extra data· 2023-08-24
65.5
best: 96.6 (EG-CFG (DeepSeek-V3-0324))
Code Llama: Open Foundation Models for Code arXiv:2308.12950