What question did this study set out to answer?

The research aims to develop a unified framework for evaluating intrusion detection systems, enhancing performance and transparency.

April 22, 2026Open Access

A cross-dataset harmonized intrusion detection framework with statistically validated multi-model learning

Key Points

The research aims to develop a unified framework for evaluating intrusion detection systems, enhancing performance and transparency.
A preprocessing pipeline is created to harmonize features from legacy and contemporary datasets.
Various learning models, including supervised and unsupervised, are assessed through cross-validation and statistical tests.
A novel cryptographic logging mechanism is introduced for result traceability.
The Random Forest model achieves 98.0% accuracy and 97.0% F1-score on the harmonized dataset.
Feature harmonization is identified as crucial for improving model performance through ablation analysis.
The proposed logging mechanism enhances traceability but is less effective than a blockchain approach.

Abstract

Intrusion Detection Systems (IDS) are considered critical security tools in ensuring network infrastructure security. However, recent studies on machine learning-based IDS systems are often constrained by their heavy dependence on a single dataset, lack of reproducibility, and lack of transparency in evaluating their performance. In addressing these challenges, a unified and transparent framework for evaluating IDS systems is proposed, which focuses on integrating feature harmonization, multi-model benchmarking, and statistical validation. In achieving this objective, a preprocessing pipeline is designed to harmonize features of both legacy and contemporary network intrusion datasets, i.e., NSL-KDD and CICIDS2017, respectively. This framework will assess various learning models, including supervised, unsupervised, deep learning, and ensemble-based models, through cross-validation and statistical tests such as Wilcoxon signed-rank, McNemar’s, and DeLong tests. Experimental results demonstrate that the Random Forest model performs best in terms of performance metrics, i.e., 98.0% accuracy and 97.0% F1-score on the harmonized data set. Moreover, feature harmonization is found to be the most important factor in improving performance using ablation analysis. Besides, a novel approach of using a cryptographic logging mechanism using SHA-256 hash chaining is proposed for tamper-evident traceability and reproducibility of results in experiments, though it is not as effective as using a blockchain-based approach. Although effective in its application, it is based on manual feature alignment and hence might not be effective in highly heterogeneous data sets.This work provides a unified, reproducible, and statistically grounded framework for evaluating IDS systems, focusing on generalization and transparency in cybersecurity research.

A cross-dataset harmonized intrusion detection framework with statistically validated multi-model learning

Key Points

Abstract

Cite This Study