官术网_书友最值得收藏!

How it works...

We start by importing our dataset of PE header information from a collection of samples (step 1). This dataset consists of two classes of PE files: malware and benign. We then use plotly to create a nice-looking interactive 3D graph (step 1). We proceed to prepare our dataset for machine learning. Specifically, in step 2, we set X as the features and y as the classes of the dataset. Based on the fact that there are two classes, we aim to cluster the data into two groups that will match the sample classification. We utilize the K-means algorithm (step 3), about which you can find more information at: https://en.wikipedia.org/wiki/K-means_clustering. With a thoroughly trained clustering algorithm, we are ready to predict on the testing set. We apply our clustering algorithm to predict to which cluster each of the samples should belong (step 4). Observing our results in step 5, we see that clustering has captured a lot of the underlying information, as it was able to fit the data well.

主站蜘蛛池模板: 邛崃市| 搜索| 嘉义市| 信宜市| 志丹县| 丰顺县| 平塘县| 武清区| 南丰县| 宁蒗| 东源县| 常州市| 锦屏县| 乐安县| 海林市| 惠东县| 樟树市| 永寿县| 比如县| 定结县| 隆尧县| 青田县| 达拉特旗| 临邑县| 长子县| 沙湾县| 苍南县| 浙江省| 井陉县| 广汉市| 洛浦县| 道真| 上高县| 贵溪市| 名山县| 榕江县| 中西区| 南宫市| 化州市| 龙门县| 瑞金市|