官术网_书友最值得收藏!

Summary

In this chapter, we used several of scikit-learn's methods for building a standard workflow to run and evaluate data mining models. We introduced the Nearest Neighbors algorithm, which is already implemented in scikit-learn as an estimator. Using this class is quite easy; first, we call the fit function on our training data, and second, we use the predict function to predict the class of testing samples.

We then looked at preprocessing by fixing poor feature scaling. This was done using a Transformer object and the MinMaxScaler class. These functions also have a fit method and then a transform, which takes a dataset as an input and returns a transformed dataset as an output.

In the next chapter, we will use these concepts in a larger example, predicting the outcome of sports matches using real-world data.

主站蜘蛛池模板: 怀远县| 鄂温| 镇赉县| 吴忠市| 伽师县| 望都县| 浙江省| 遂川县| 合川市| 贡觉县| 隆子县| 南开区| 呼图壁县| 高邑县| 延长县| 九龙坡区| 北碚区| 北碚区| 孙吴县| 峨山| 易门县| 江永县| 宿松县| 健康| 区。| 临桂县| 盘锦市| 丁青县| 乐清市| 江孜县| 克什克腾旗| 且末县| 南丰县| 砚山县| 泗阳县| 田东县| 板桥市| 新野县| 广南县| 城市| 阿拉善右旗|