TY - GEN
T1 - A Semi-Supervised Federated Learning Scheme via Knowledge Distillation for Intrusion Detection
AU - Zhao, Ruijie
AU - Yang, Linbo
AU - Wang, Yijun
AU - Xue, Zhi
AU - Gui, Guan
AU - Ohtsuki, Tomoaki
N1 - Funding Information:
ACKNOWLEDGMENT This work was supported by the Foundation Item: Cy-ber Security from the National Key Research and Development Program of Shanghai Jiao Tong University under Grant 2019QY0703. Yijun Wang and Guan Gui are the corresponding authors.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Federated learning (FL) has become an increasingly popular solution for intrusion detection to avoid data privacy leakage in Internet of Things (IoT) edge devices. However, most of the current FL-based intrusion detection methods still suffer from three limitations: (1) model parameters transmitted in each round may be used to recover private data which leads to security risks, (2) not independent and identically distributed (non-IID) private data seriously adversely affects the training of FL (especially distillation-based FL), and (3) high communication overhead caused by the large model size greatly hinders the actual deployment of the solution. To address these problems, this paper develops an intrusion detection method based on semi-supervised FL scheme via knowledge distillation. First, our proposed method leverages unlabeled data via distillation method to enhance the classifier performance. Second, we build a CNN-based model for extracting deep features of the traffic packets, and take this model as both the classifier network and discriminator network. Third, discriminator is designed to improve the quality of each client's predicted labels, to avoid the failure of distillation training caused by a large number of incorrect predictions under private non-IID data. Moreover, the combination of hard-label strategy and voting mechanism further reduces communication overhead. Experimental results on the real-world traffic dataset show that our proposed method can achieve better classification performance as well as lower communication overhead than state-of-the-art methods.
AB - Federated learning (FL) has become an increasingly popular solution for intrusion detection to avoid data privacy leakage in Internet of Things (IoT) edge devices. However, most of the current FL-based intrusion detection methods still suffer from three limitations: (1) model parameters transmitted in each round may be used to recover private data which leads to security risks, (2) not independent and identically distributed (non-IID) private data seriously adversely affects the training of FL (especially distillation-based FL), and (3) high communication overhead caused by the large model size greatly hinders the actual deployment of the solution. To address these problems, this paper develops an intrusion detection method based on semi-supervised FL scheme via knowledge distillation. First, our proposed method leverages unlabeled data via distillation method to enhance the classifier performance. Second, we build a CNN-based model for extracting deep features of the traffic packets, and take this model as both the classifier network and discriminator network. Third, discriminator is designed to improve the quality of each client's predicted labels, to avoid the failure of distillation training caused by a large number of incorrect predictions under private non-IID data. Moreover, the combination of hard-label strategy and voting mechanism further reduces communication overhead. Experimental results on the real-world traffic dataset show that our proposed method can achieve better classification performance as well as lower communication overhead than state-of-the-art methods.
KW - Intrusion detection
KW - federated learning
KW - knowledge distillation
KW - semi-supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85137267419&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137267419&partnerID=8YFLogxK
U2 - 10.1109/ICC45855.2022.9838256
DO - 10.1109/ICC45855.2022.9838256
M3 - Conference contribution
AN - SCOPUS:85137267419
T3 - IEEE International Conference on Communications
SP - 2688
EP - 2693
BT - ICC 2022 - IEEE International Conference on Communications
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Communications, ICC 2022
Y2 - 16 May 2022 through 20 May 2022
ER -