RMCBench

TextsarXiv perpetual, non-exclusive licenseIntroduced 2024-09-23

The first benchmark comprising 473 prompts designed to assess the ability of LLMs to resist malicious code generation.