官术网_书友最值得收藏!

Supervised learning

As we discussed previously, for supervised learning we have some information attached to each data point, the label, and we can train a model to use it and to learn from it. For example, if we want to build a model that tells us whether there is a dog or a cat on a picture, then the picture is the data point and the information whether it is a dog or a cat is the label. Another example is predicting the price of a house--the description of a house is the data point, and the price is the label. 

We can group the algorithms of supervised learning into classification and regression algorithms based on the nature of this information.

In classification problems, the labels come from some fixed finite set of classes, such as {cat, dog}, {default, not default}, or {office, food, entertainment, home}. Depending on the number of classes, the classification problem can be binary (only two possible classes) or multi-class (several classes).

Examples of classification algorithms are Naive Bayes, logistic regression, perceptron, Support Vector Machine (SVM), and many others. We will discuss classification algorithms in more detail in the first part of Chapter 4, Supervised Learning - Classification and Regression.

In regression problems, the labels are real numbers. For example, a person can have a salary in the range from $0 per year to several billions per year. Hence, predicting the salary is a regression problem.

Examples of regression algorithms are linear regression, LASSO, Support Vector Regression (SVR), and others. These algorithms will be described in more detail in the second part of Chapter 4, Supervised Learning - Classification and Regression.

Some of the supervised learning methods are universal and can be applied to both classification and regression problems. For example, decision trees, random forest, and other tree-based methods can tackle both types. We will discuss one such algorithm, gradient boosting machines in Chapter 7, Extreme Gradient Boosting.

Neural networks can also deal with both classification and regression problems, and we will talk about them in Chapter 8Deep Learning with DeepLearning4J.

主站蜘蛛池模板: 密山市| 郴州市| 霍林郭勒市| 天峨县| 天全县| 嵩明县| 河曲县| 台北县| 墨玉县| 昭通市| 遂溪县| 洮南市| 乌苏市| 西华县| 枣强县| 佛山市| 天峻县| 石台县| 濮阳市| 嘉定区| 古丈县| 鄯善县| 即墨市| 成都市| 博客| 泰顺县| 彭山县| 高邑县| 宽甸| 竹溪县| 南木林县| 昆山市| 揭西县| 通海县| 阿巴嘎旗| 团风县| 邵阳县| 醴陵市| 湘阴县| 龙海市| 宁海县|