官术网_书友最值得收藏!

Example of feature engineering procedures – can anyone really predict the weather?

Consider a machine learning pipeline that was built to predict the weather. For the sake of simplicity in our introduction chapter, assume that our algorithm takes in atmospheric data directly from sensors and is set up to predict between one of two values, sun or rain. This pipeline is then, clearly, a classification pipeline that can only spit out one of two answers. We will run this algorithm at the beginning of every day. If the algorithm outputs sun and the day is mostly sunny, the algorithm was correct, likewise, if the algorithm predicts rain and the day is mostly rainy, the algorithm was correct. In any other instance, the algorithm would be considered incorrect. If we run the algorithm every day for a month, we would obtain nearly 30 values of the predicted weather and the actual, observed weather. We can calculate an accuracy of the algorithm. Perhaps the algorithm predicted correctly for 20 out of the 30 days, leading us to label the algorithm with a two out of three or about 67% accuracy. Using this standardized value or accuracy, we could tweak our algorithm and see if the accuracy goes up or down.

Of course, this is an oversimplification, but the idea is that for any machine learning pipeline, it is essentially useless if we cannot evaluate its performance using a set of standard metrics and therefore, feature engineering as applied to the bettering of machine learning, is impossible without said evaluation procedure. Throughout this book, we will revisit this idea of evaluation; however, let’s talk briefly about how, in general, we will approach this idea.

When presented with a topic in feature engineering, it will usually involve transforming our dataset (as per the definition of feature engineering). In order to definitely say whether or not a particular feature engineering procedure has helped our machine learning algorithm, we will follow the steps detailed in the following section.

主站蜘蛛池模板: 克东县| 城固县| 河东区| 北海市| 四子王旗| 河津市| 仪陇县| 囊谦县| 保定市| 衡东县| 额济纳旗| 衡水市| 天津市| 将乐县| 玉溪市| 岑巩县| 新昌县| 繁峙县| 河曲县| 青川县| 扶沟县| 丽水市| 呼图壁县| 绥中县| 新干县| 阜南县| 鞍山市| 岚皋县| 龙川县| 海兴县| 东明县| 宜兰市| 红原县| 台湾省| 四会市| 安国市| 武穴市| 瑞安市| 尼勒克县| 临城县| 建阳市|