官术网_书友最值得收藏!

Supervised learning algorithms

Supervised algorithms rely on human knowledge to complete their tasks. Let's say we have a dataset related to loan repayment that contains several demographic indicators, as well as whether a loan was paid back or not:

The Paid column, which tells us if a loan was paid back or not, is called the target - it's what we would like to predict. The data that contains information about the applicants background is known as the features of the datasets. In supervised learning, algorithms learn to predict the target based on the features, or in other words, what indicators give a high probability that an applicant will pay back a loan or not? Mathematically, this process looks as follows:

Here, we are saying that our label  is a function of the input features , plus some amount of error  that it caused naturally by the dataset. We know that a certain set of features will likely produce a certain outcome. In supervised learning, we set up an algorithm to learn what function will produce the correct mapping of a set of features to an outcome. 

To illustrate how supervised learning works, we are going to utilize a famous example toy dataset in the machine learning field, the Iris Dataset. It shows four features: Sepal Length, Sepal Width, Petal Length, and Petal Width. In this dataset, our target variable (sometimes called a label) is Name. The dataset is available in the GitHub repository that corresponds with this chapter:

import pandas as pd
data = pd.read_csv("iris.csv")
data.head()

The preceding code generates the following output:

Now that we have our data ready to go, let's jump into some supervised learning!

主站蜘蛛池模板: 崇左市| 高淳县| 荥阳市| 邢台县| 新疆| 宜川县| 巫山县| 光山县| 泗阳县| 扶余县| 泸西县| 连州市| 黄浦区| 中卫市| 岗巴县| 沾化县| 南投县| 南雄市| 杭锦后旗| 通道| 洛阳市| 德江县| 万全县| 新建县| 沙田区| 葵青区| 赤城县| 平安县| 全州县| 西丰县| 灵武市| 旅游| 雅安市| 博客| 湛江市| 绿春县| 乌拉特前旗| 略阳县| 宜宾市| 宜章县| 巢湖市|