INSOMNIA: Towards Concept-Drift Robustness in Network Intrusion Detection

Giuseppina Andresini; Feargus Pendlebury; Fabio Pierazzi; Corrado Loglisci; Annalisa Appice; Lorenzo Cavallaro

doi:10.1145/3474369.3486864

INSOMNIA: Towards Concept-Drift Robustness in Network Intrusion Detection

Giuseppina Andresini, Feargus Pendlebury, Fabio Pierazzi, Corrado Loglisci, Annalisa Appice, Lorenzo Cavallaro

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

112 Downloads (Pure)

Abstract

Despite decades of research in network traffic analysis and great advances in artificial intelligence, network intrusion detection systems based on machine learning (ML) have yet to prove their worth. One core obstacle is the existence of concept drift, an issue for all adversary-facing security systems. Additionally, specific challenges set intrusion detection apart from other ML-based security tasks, such as malware detection. In this work, we offer a new perspective on these challenges. We propose INSOMNIA, a semi-supervised intrusion detector which continuously updates the underlying ML model as network traffic characteristics are affected by concept drift. We use active learning to reduce latency in the model updates, label estimation to reduce labeling overhead, and apply explainable AI to better interpret how the model reacts to the shifting distribution. To evaluate INSOMNIA, we extend TESSERACT—a framework originally proposed for performing sound time-aware evaluations of ML-based malware detectors—to the network intrusion domain and show that accounting for drift is vital for effective detection.

Original language	English
Title of host publication	AISec '21
Subtitle of host publication	Proceedings of the 14th ACM Workshop on Artificial Intelligence and Security
Publisher	ACM
Pages	111-122
Number of pages	12
DOIs	https://doi.org/10.1145/3474369.3486864
Publication status	Published - 15 Nov 2021

Access to Document

10.1145/3474369.3486864

Accepted ManuscriptAccepted author manuscript, 1.84 MB

Cite this

@inproceedings{2066d94d581044c98260fe1b226c5c29,

title = "INSOMNIA: Towards Concept-Drift Robustness in Network Intrusion Detection",

abstract = "Despite decades of research in network traffic analysis and great advances in artificial intelligence, network intrusion detection systems based on machine learning (ML) have yet to prove their worth. One core obstacle is the existence of concept drift, an issue for all adversary-facing security systems. Additionally, specific challenges set intrusion detection apart from other ML-based security tasks, such as malware detection. In this work, we offer a new perspective on these challenges. We propose INSOMNIA, a semi-supervised intrusion detector which continuously updates the underlying ML model as network traffic characteristics are affected by concept drift. We use active learning to reduce latency in the model updates, label estimation to reduce labeling overhead, and apply explainable AI to better interpret how the model reacts to the shifting distribution. To evaluate INSOMNIA, we extend TESSERACT—a framework originally proposed for performing sound time-aware evaluations of ML-based malware detectors—to the network intrusion domain and show that accounting for drift is vital for effective detection.",

author = "Giuseppina Andresini and Feargus Pendlebury and Fabio Pierazzi and Corrado Loglisci and Annalisa Appice and Lorenzo Cavallaro",

year = "2021",

month = nov,

day = "15",

doi = "10.1145/3474369.3486864",

language = "English",

pages = "111--122",

booktitle = "AISec '21",

publisher = "ACM",

}

TY - GEN

T1 - INSOMNIA

T2 - Towards Concept-Drift Robustness in Network Intrusion Detection

AU - Andresini, Giuseppina

AU - Pendlebury, Feargus

AU - Pierazzi, Fabio

AU - Loglisci, Corrado

AU - Appice, Annalisa

AU - Cavallaro, Lorenzo

PY - 2021/11/15

Y1 - 2021/11/15

N2 - Despite decades of research in network traffic analysis and great advances in artificial intelligence, network intrusion detection systems based on machine learning (ML) have yet to prove their worth. One core obstacle is the existence of concept drift, an issue for all adversary-facing security systems. Additionally, specific challenges set intrusion detection apart from other ML-based security tasks, such as malware detection. In this work, we offer a new perspective on these challenges. We propose INSOMNIA, a semi-supervised intrusion detector which continuously updates the underlying ML model as network traffic characteristics are affected by concept drift. We use active learning to reduce latency in the model updates, label estimation to reduce labeling overhead, and apply explainable AI to better interpret how the model reacts to the shifting distribution. To evaluate INSOMNIA, we extend TESSERACT—a framework originally proposed for performing sound time-aware evaluations of ML-based malware detectors—to the network intrusion domain and show that accounting for drift is vital for effective detection.

AB - Despite decades of research in network traffic analysis and great advances in artificial intelligence, network intrusion detection systems based on machine learning (ML) have yet to prove their worth. One core obstacle is the existence of concept drift, an issue for all adversary-facing security systems. Additionally, specific challenges set intrusion detection apart from other ML-based security tasks, such as malware detection. In this work, we offer a new perspective on these challenges. We propose INSOMNIA, a semi-supervised intrusion detector which continuously updates the underlying ML model as network traffic characteristics are affected by concept drift. We use active learning to reduce latency in the model updates, label estimation to reduce labeling overhead, and apply explainable AI to better interpret how the model reacts to the shifting distribution. To evaluate INSOMNIA, we extend TESSERACT—a framework originally proposed for performing sound time-aware evaluations of ML-based malware detectors—to the network intrusion domain and show that accounting for drift is vital for effective detection.

U2 - 10.1145/3474369.3486864

DO - 10.1145/3474369.3486864

M3 - Conference contribution

SP - 111

EP - 122

BT - AISec '21

PB - ACM

ER -