Comparison of Ensemble Learning Methods on the IoT-23 Dataset
Abstract
The Internet of Things (IoT) has provided numerous benefits across various sectors, but it also poses significant challenges in cybersecurity, particularly malware threats. Malware on IoT devices has the potential to damage systems, steal data, and disrupt network performance. Previous research has shown that the Na ̈ıve Bayes algorithm produces a low accuracy of 0.24, increasing slightly to 0.35 when combined with AdaBoost, and reaching 0.99 when combined with XGBoost using the soft voting method. However, there is still room to explore other ensemble learning methods to obtain more stable results. This research focuses on the application of an alternative ensemble learning method, namely stacking, using the IoT-23 dataset with reference to the CRISP-DM framework. The results show that the stacking method can significantly improve malware detection accuracy from 0.35 to 0.72, thus proving superior to soft voting and can be an effective approach in improving malware detection performance in IoT networks.
References
R. Chiwariro and L. Pullagura, “Malware detection and classification using machine learning algorithms,” International Journal of Research in Applied Science and Engineering Technology, vol. 11, no. 8, pp. 1727–1738, 2023.
Sharipuddin, R. S. Putra, M. F. Aulia, S. A. Maulana, and P. A. Jusia, “Android security: Malware detection with convolutional neural network and feature analysis,” Media Journal of General Computer Science, vol. 1, no. 1, pp. 7–13, 2023.
D. Arianyah and I. V. Paputungan, “Jurnal sains, nalar, dan aplikasi teknologi informasi,” Jurnal Sains, Nalar, dan Aplikasi Teknologi Informasi, vol. 3, no. 2, pp. 50–57, 2024.
M. Kumar, K. Bajaj, B. Sharma, and S. Narang, “A comparative performance assessment of optimized multilevel ensemble learning model with existing classifier models,” Big ,vol. 10, no. 5, pp. 371–387, 2021.
K. Anggriani, S. A. Zahra, and A. Susanto, “Enhancing malware detection in iot networks using ensemble learning on iot-23 dataset,” Jurnal Teknik Informatika (Jutif), vol. 6, no. 4, pp. 1985–2000, 2025.
S. Joses, S. Quinevera, R. Mardianto, D. Yulvida, and A. M. Shiddiqi, “Pendekatan metode ensemble learning untuk deteksi serangan ddos menggunakan soft voting classifier,” Jurnal Edukasi dan Penelitian Informatika, vol. 10, no. 1, p. 79, 2024.
M. I. Anugrah, J. Zeniarja, and D. S. Setiawan, “Peningkatan performa model hard voting classifier dengan teknik oversampling adasyn pada penyakit diabetes,” Edumatic: Jurnal Pendidikan Informatika, vol. 8, no. 1, pp. 290–299, 2024.
A. Munna and E. Zuliarso, “Interpretasi model stacking ensemble untuk analisis sentimen ulasan aplikasi pinjaman online menggunakan lime,” AITI, vol. 21, no. 2, pp. 183–196, 2024.
A. K. Putri and H. Suparwito, “Uji algoritma stacking ensemble classifier pada kemampuan adaptasi mahasiswa baru dalam pembelajaran online,” KONSTELASI: Konvergensi Teknologi dan Sistem Informasi, vol. 3, no. 1, pp. 1–12, 2023.
D. Vijayanand and R. K. Singh, “Guardians of iot: Malware analysis of iot devices using machine learning,” TuijinJishu/Journal of Propulsion Technology, vol. 45, no. 1,
pp. 911–924, 2024.
N. Abdalgawad, A. Sajun, Y. Kaddoura, I. A. Zualkernan, and F. Aloul, “Generative deep learning to detect cyberattacks for the iot-23 dataset,” IEEE Access, vol. 10, pp. 6430–6441, 2022.
M. Ihsan, R. K. Niswatin, and D. Swanjaya, “Deteksi ekspresi wajah menggunakan tensorflow,” Joutica, vol. 6, no. 1, p. 428, 2021.
F. Abdusyukur, “Penerapan algoritma support vector machine (svm) untuk klasifikasi pencemaran nama baik di media sosial twitter,” Komputa: Jurnal Ilmiah Komputer dan Informatika, vol. 12, no. 1, pp. 73–82, 2023.
C. V. Oha, F. S. Farouk, P. P. Patel, P. Meka, S. Nekkanti, B. Nayini, S. X. Carvalho, N. Desai, and M. Patel, “Analysis of iot-23 datasets and machine learning models for malicious traffic detection,” 2021.
I. J. Fadillah and C. D. Puspita, “Pemanfaatan metode weighted k-nearest neighbor imputation (weighted knni) untuk mengatasi missing data,” in Seminar Nasional Official Statistics, vol. 2020, pp. 511–518, 2021.
I. O. Muraina, “Ideal dataset splitting ratios in machine learning algorithms: General concerns for data scientists and data analysts,” in 7th International Mardin Artuklu Scientific Research Conference, pp. 496–504, 2022.
N. A. Stoian, “Machine learning for anomaly detection in iot networks: Malware analysis on the iot-23 data set,” 2020.
N. Al Sarah, F. Y. Rifat, M. S. Hossain, and H. S. Narman, “An efficient android malware prediction using ensemble machine learning algorithms,” Procedia Computer Science, vol. 191, pp. 184–191, 2021.










