官术网_书友最值得收藏!

Summary

In this chapter, we used several of scikit-learn's methods for building a standard workflow to run and evaluate data mining models. We introduced the Nearest Neighbors algorithm, which is already implemented in scikit-learn as an estimator. Using this class is quite easy; first, we call the fit function on our training data, and second, we use the predict function to predict the class of testing samples.

We then looked at preprocessing by fixing poor feature scaling. This was done using a Transformer object and the MinMaxScaler class. These functions also have a fit method and then a transform, which takes a dataset as an input and returns a transformed dataset as an output.

In the next chapter, we will use these concepts in a larger example, predicting the outcome of sports matches using real-world data.

主站蜘蛛池模板: 益阳市| 蓝田县| 扶沟县| 垣曲县| 太湖县| 台南市| 印江| 邓州市| 碌曲县| 吉安市| 民丰县| 永泰县| 都匀市| 福海县| 安图县| 长岛县| 金阳县| 东光县| 崇仁县| 那坡县| 衡南县| 革吉县| 汕尾市| 日喀则市| 阳原县| 平南县| 镶黄旗| 两当县| 岗巴县| 肇庆市| 策勒县| 绥化市| 马尔康县| 米林县| 济南市| 宜城市| 府谷县| 金寨县| 江源县| 宁河县| 法库县|