RealVul

RealVul-Vulnerability Dataset following realistic settings

TextsIntroduced 2024-07-03

This is a C++ vulnerability detection dataset following realistic settings. For details, please check our study Revisiting the Performance of Deep Learning-Based Vulnerability Detection on Realistic Datasets (Partha et al., 2024)

The column names are self-describing. The most important two columns are,

Target: int: vulnerable to not. Code: str: the code segment. For details, please check our huggingface repository.