Pakistan Science Abstracts
Article details & metrics
No Detail Found!!
Tree-Based Learning Models for Botnet Malware Classification in Real World Sub-Sample Dataset
Author(s):
1. Akinyemi Moruff Oyelakin: Crescent University, Abeokuta, Nigeria
2. Rasheed Gbenga Jimoh: University of Ilorin, Ilorin, Nigeria
Abstract:
The use of machine learning techniques for botnet detection has been an active area of research in security field for some years now. Some of the past machine learning-based botnet detection studies used datasets that were generated synthetically. The release of a large and real-life botnet dataset, named CTU-13, allowed researchers to build machine learning-based models from real-world data. In fact, the real-life traces in the dataset makes it more promising for being used for botnet identification studies. The current study proposed the use of a single tree-based learning algorithm in the classification of botnet evidence from sub-sampled portion of three captures in CTU-13 dataset. Random sub-sampling was used to arrive at three different datasets that was used in the study. The first step in the methodology involved experimental analyses on three captures out of the thirteen in the whole dataset. The analyses revealed the basic characteristics of the datasets which further guided the study. The missing values and categorical data types in the dataset were handled through mixed imputation and feature encoding, respectively. The big data nature of the dataset was handled through random sub-sampling technique with a view to building a botnet detection model that is less computationally intensive. The random sub-sampling technique was used without changing the data distributions in the dataset. The botnet detection models were built by using decision-tree algorithm from the three sub-sampled dataset captures. The performances of the models were evaluated by using accuracy, precision, recall, and f1-score, respectively. In all, the model built with scenario5 capture slightly performed better than the ones built using scenario 6 and scenario 7 captures, respectively.
Page(s): 1-13
Published: Journal: Innovative Computing Review (ICR), Volume: 3, Issue: 2, Year: 2023
Keywords:
Malware Detection , botnet malware , bot communication , tree learning
References:
References are not available for this document.
Citations
Citations are not available for this document.
0

Citations

0

Downloads

7

Views