- Mastering Java for Data Science
- Alexey Grigorev
- 350字
- 2021-07-02 23:44:31
Supervised learning
As we discussed previously, for supervised learning we have some information attached to each data point, the label, and we can train a model to use it and to learn from it. For example, if we want to build a model that tells us whether there is a dog or a cat on a picture, then the picture is the data point and the information whether it is a dog or a cat is the label. Another example is predicting the price of a house--the description of a house is the data point, and the price is the label.
We can group the algorithms of supervised learning into classification and regression algorithms based on the nature of this information.
In classification problems, the labels come from some fixed finite set of classes, such as {cat, dog}, {default, not default}, or {office, food, entertainment, home}. Depending on the number of classes, the classification problem can be binary (only two possible classes) or multi-class (several classes).
Examples of classification algorithms are Naive Bayes, logistic regression, perceptron, Support Vector Machine (SVM), and many others. We will discuss classification algorithms in more detail in the first part of Chapter 4, Supervised Learning - Classification and Regression.
In regression problems, the labels are real numbers. For example, a person can have a salary in the range from $0 per year to several billions per year. Hence, predicting the salary is a regression problem.
Examples of regression algorithms are linear regression, LASSO, Support Vector Regression (SVR), and others. These algorithms will be described in more detail in the second part of Chapter 4, Supervised Learning - Classification and Regression.
Some of the supervised learning methods are universal and can be applied to both classification and regression problems. For example, decision trees, random forest, and other tree-based methods can tackle both types. We will discuss one such algorithm, gradient boosting machines in Chapter 7, Extreme Gradient Boosting.
Neural networks can also deal with both classification and regression problems, and we will talk about them in Chapter 8, Deep Learning with DeepLearning4J.
- 卷積神經網絡的Python實現
- Python醫學數據分析入門
- Scratch 3.0 藝術進階
- 數據庫程序員面試筆試真題庫
- 智能數據時代:企業大數據戰略與實戰
- Spark大數據編程實用教程
- MATLAB Graphics and Data Visualization Cookbook
- 數據科學工程實踐:用戶行為分析與建模、A/B實驗、SQLFlow
- 一本書講透Elasticsearch:原理、進階與工程實踐
- Oracle數據庫管理、開發與實踐
- Web Services Testing with soapUI
- Oracle 11g+ASP.NET數據庫系統開發案例教程
- 算力經濟:從超級計算到云計算
- Unity Game Development Blueprints
- 大數據隱私保護技術與治理機制研究