官术网_书友最值得收藏!

The FeatureExtractor class

This newly added class provides our feature extraction for the given folder of filesOnce extraction is complete, the classification and strings data is written out to the sampledata file:

using System;
using System.IO;

using chapter03_logistic_regression.Common;
using chapter03_logistic_regression.ML.Base;

namespace chapter03_logistic_regression.ML
{
public class FeatureExtractor : BaseML
{
public void Extract(string folderPath)
{
var files = Directory.GetFiles(folderPath);

using (var streamWriter =
new StreamWriter(Path.Combine(AppContext.BaseDirectory, $"../../../Data/{Constants.SAMPLE_DATA}")))
{
foreach (var file in files)
{
var strings = GetStrings(File.ReadAllBytes(file));

streamWriter.WriteLine($"{file.ToLower().Contains("malicious")}\t{strings}");
}
}

Console.WriteLine($"Extracted {files.Length} to {Constants.SAMPLE_DATA}");
}
}
}
主站蜘蛛池模板: 海丰县| 拉萨市| 湟源县| 林芝县| 西城区| 赤峰市| 平阴县| 乐陵市| 石台县| 敦化市| 崇文区| 淄博市| 饶河县| 蕲春县| 临清市| 农安县| 行唐县| 江门市| 肇东市| 夏河县| 唐河县| 义乌市| 陇西县| 始兴县| 阜平县| 株洲市| 神农架林区| 焦作市| 马关县| 南皮县| 景德镇市| 正蓝旗| 临高县| 杂多县| 安徽省| 天门市| 成都市| 剑阁县| 兴仁县| 景洪市| 沧源|