官术网_书友最值得收藏!

Evaluating the fitness of the model with a cost function

Regression lines produced by several sets of parameter values are plotted in the following figure. How can we assess which parameters produced the best-fitting regression line?

A cost function, also called a loss function, is used to define and measure the error of a model. The differences between the prices predicted by the model and the observed prices of the pizzas in the training set are called residuals, or training errors. Later, we will evaluate the model on a separate set of test data. The differences between the predicted and observed values in the test data are called prediction errors, or test errors. The residuals for our model are indicated by vertical lines between the points for the training instances and the regression hyperplane in the following plot:

We can produce the best pizza-price predictor by minimizing the sum of the residuals. That is, our model fits if the values it predicts for the response variable are close to the observed values for all of the training examples. This measure of the model's fitness is called the residual sum of squares (RSS) cost function. Formally, this function assesses the fitness of a model by summing the squared residuals for all of our training examples. The RSS is calculated with the formula in the following equation, where yi is the observed value and f(xi) is the predicted value:

Let's compute the RSS for our model by adding the following two lines to the previous script:

print('Residual sum of squares: %.2f' % np.mean((model.predict(X)
- y) ** 2))
Residual sum of squares: 1.75

Now that we have a cost function, we can find the values of the model's parameters that minimize it.

主站蜘蛛池模板: 汉源县| 正蓝旗| 民权县| 泾阳县| 马山县| 黔南| 汶川县| 闸北区| 贵港市| 巴彦县| 大荔县| 长岭县| 英德市| 德令哈市| 禹州市| 新蔡县| 图木舒克市| 会东县| 靖边县| 琼结县| 张家川| 响水县| 松溪县| 姜堰市| 定南县| 庆云县| 平阴县| 荣成市| 大庆市| 阿拉善左旗| 榆中县| 遂溪县| 夹江县| 平远县| 彭泽县| 河南省| 樟树市| 孟村| 金塔县| 招远市| 汉沽区|