官术网_书友最值得收藏!

Supervised learning algorithms

Supervised algorithms rely on human knowledge to complete their tasks. Let's say we have a dataset related to loan repayment that contains several demographic indicators, as well as whether a loan was paid back or not:

The Paid column, which tells us if a loan was paid back or not, is called the target - it's what we would like to predict. The data that contains information about the applicants background is known as the features of the datasets. In supervised learning, algorithms learn to predict the target based on the features, or in other words, what indicators give a high probability that an applicant will pay back a loan or not? Mathematically, this process looks as follows:

Here, we are saying that our label  is a function of the input features , plus some amount of error  that it caused naturally by the dataset. We know that a certain set of features will likely produce a certain outcome. In supervised learning, we set up an algorithm to learn what function will produce the correct mapping of a set of features to an outcome. 

To illustrate how supervised learning works, we are going to utilize a famous example toy dataset in the machine learning field, the Iris Dataset. It shows four features: Sepal Length, Sepal Width, Petal Length, and Petal Width. In this dataset, our target variable (sometimes called a label) is Name. The dataset is available in the GitHub repository that corresponds with this chapter:

import pandas as pd
data = pd.read_csv("iris.csv")
data.head()

The preceding code generates the following output:

Now that we have our data ready to go, let's jump into some supervised learning!

主站蜘蛛池模板: 武安市| 安溪县| 民和| 故城县| 谢通门县| 林州市| 德阳市| 商都县| 富锦市| 五莲县| 广宁县| 华亭县| 鄂尔多斯市| 鹿邑县| 阿勒泰市| 安吉县| 榆社县| 迁西县| 横山县| 隆子县| 榕江县| 辽中县| 延庆县| 庄浪县| 即墨市| 丹阳市| 华亭县| 灌云县| 浑源县| 铜川市| 抚远县| 天门市| 四平市| 定西市| 和龙市| 眉山市| 巫溪县| 苍梧县| 滦平县| 滨州市| 平利县|