NetSecData
TextsApache 2.0Introduced 2024-09-17
The dataset comprises 1641 questions and answers generated as three separate parts. The first part of the dataset contains questions and answers that test the model’s ability to understand the current status of the environment and the provided rules. The second part of the dataset contains questions that test the model’s ability to generate valid actions, both in terms of syntax (JSON format) and semantics (validity in the specific state). The 3rd part of the dataset part aims to teach the fine-tuned models how to make correct decisions given a specific environment state.
The dataset was used to fine-tune LLMs to perform tasks in the NetSecGame environment.