官术网_书友最值得收藏!

Summary

In this chapter, we used several of scikit-learn's methods for building a standard workflow to run and evaluate data mining models. We introduced the Nearest Neighbors algorithm, which is already implemented in scikit-learn as an estimator. Using this class is quite easy; first, we call the fit function on our training data, and second, we use the predict function to predict the class of testing samples.

We then looked at preprocessing by fixing poor feature scaling. This was done using a Transformer object and the MinMaxScaler class. These functions also have a fit method and then a transform, which takes a dataset as an input and returns a transformed dataset as an output.

In the next chapter, we will use these concepts in a larger example, predicting the outcome of sports matches using real-world data.

主站蜘蛛池模板: 瑞安市| 新建县| 延庆县| 太谷县| 盐池县| 祁阳县| 海安县| 杂多县| 万载县| 岳普湖县| 黔南| 凤凰县| 宁都县| 社会| 徐闻县| 修文县| 凉城县| 义马市| 兴化市| 上杭县| 巴南区| 鄯善县| 曲麻莱县| 枣强县| 鞍山市| 砀山县| 新邵县| 出国| 庆安县| 上虞市| 宝山区| 凤山市| 大兴区| 义马市| 大石桥市| 舒城县| 五家渠市| 惠水县| 怀远县| 满洲里市| 集安市|