官术网_书友最值得收藏!

How it works...

We start by importing our dataset of PE header information from a collection of samples (step 1). This dataset consists of two classes of PE files: malware and benign. We then use plotly to create a nice-looking interactive 3D graph (step 1). We proceed to prepare our dataset for machine learning. Specifically, in step 2, we set X as the features and y as the classes of the dataset. Based on the fact that there are two classes, we aim to cluster the data into two groups that will match the sample classification. We utilize the K-means algorithm (step 3), about which you can find more information at: https://en.wikipedia.org/wiki/K-means_clustering. With a thoroughly trained clustering algorithm, we are ready to predict on the testing set. We apply our clustering algorithm to predict to which cluster each of the samples should belong (step 4). Observing our results in step 5, we see that clustering has captured a lot of the underlying information, as it was able to fit the data well.

主站蜘蛛池模板: 孟村| 皋兰县| 上饶县| 苏尼特右旗| 小金县| 礼泉县| 平顶山市| 桐庐县| 安义县| 义乌市| 三明市| 五莲县| 灵丘县| 沙湾县| 西充县| 习水县| 镇安县| 桃园县| 隆林| 松桃| 太原市| 顺义区| 栾川县| 姜堰市| 准格尔旗| 永和县| 剑河县| 鸡东县| 神木县| 射洪县| 长兴县| 定西市| 平阴县| 运城市| 罗江县| 南召县| 和静县| 景德镇市| 西宁市| 辽中县| 张家界市|