Vulnerability Java Dataset
TextsIntroduced 2024-03-01
The dataset consists of two versions: with and without , where represents a set of random unchanged functions from vulnerability fixing commits. This dataset is designed for finetuning large language models to detect vulnerabilities in code. It can be used for training and evaluating models in automated vulnerability detection tasks.
Source: Finetuning Large Language Models for Vulnerability Detection