Getting Started¶
The Cyber Range Kyoushi Dataset tool implements the processing layer for the Kyoushi model-driven IDS dataset generation and labeling framework as described by Frank [Frank21] and shown in Figure 1. Dataset definition and generation are handled by the Model, Testbed and Data Collection layers.

The processing layer takes a raw dataset (logs and facts) and model definition as input for processing to create IDS labels for the log events contained in the dataset. This is done in a 5 step processes implemented by the Cyber Range Kyoushi Dataset tool:
- Prepare
- Process
- Pre-Process
- Parse
- Post-Process
- Label
Additionally the tool also implements a CLI command that can be used for sampling a labeled Kyoushi dataset (also see the CLI reference).
References¶
Frank21: Frank, M. Quality improvement of labels for model-driven benchmark data generation for intrusion detection systems. (2021) doi:0.34726/HSS.2021.82646.