官术网_书友最值得收藏!

Summary

In this chapter, we defined machine learning as the design of programs that can improve their performance at a task by learning from experience. We discussed the spectrum of supervision in experience. At one end is supervised learning, in which a program learns from inputs that are labeled with their corresponding outputs. Unsupervised learning, in which the program must discover structure in only unlabeled inputs, is at the opposite end of the spectrum. Semi-supervised approaches make use of both labeled and unlabeled training data.

Next we discussed common types of machine learning tasks and reviewed examples of each. In classification tasks the program predict the value of a discrete response variable from the observed explanatory variables. In regression tasks the program must predict the value of a continuous response variable from the explanatory variables. Unsupervised learning tasks include clustering, in which observations are organized into groups according to some similarity measure, and dimensionality reduction, which reduces a set of explanatory variables to a smaller set of synthetic features that retain as much information as possible. We also reviewed the bias-variance trade-off and discussed common performance measures for different machine learning tasks.

In this chapter we discussed the history, goals, and advantages of scikit-learn. Finally, we prepared our development environment by installing scikit-learn and other libraries that are commonly used in conjunction with it. In the next chapter we will discuss a simple model for regression tasks, and build our first machine learning model with scikit-learn.

主站蜘蛛池模板: 桓台县| 沁阳市| 汾阳市| 利津县| 罗甸县| 丰宁| 岚皋县| 乌恰县| 中超| 永川市| 惠来县| 通化县| 泾源县| 邵阳市| 都兰县| 石楼县| 龙胜| 五河县| 长丰县| 吉木萨尔县| 张家界市| 二手房| 宜城市| 三台县| 商丘市| 新巴尔虎右旗| 山丹县| 靖江市| 湘潭市| 阳高县| 马山县| 永春县| 望城县| 海丰县| 南京市| 安新县| 兰坪| 怀化市| 信阳市| 吴旗县| 利辛县|