官术网_书友最值得收藏!

The cost function of neural networks

We will now explore how can we evaluate the performance of a neural network by using the cost function. We will use it to measure how far we are from the expected value. We are going to use the following notation and variables:

Variable Y to represent the true value

Variable a to represent the neuron prediction

In terms of weight and biases, the formula is as follows:

We pass z, which is the input (X) times the weight (X) added to the bias (b), into the activation function of .

There are many types of cost functions, but we are just going to discuss two of them:

The quadratic cost function

The cross-entropy function

The first cost function we are going to discuss is the quadratic cost function, which is represented with the following formula:

In the preceding formula, we can see that when the error is high, which means the actual value (Y) is less than the predictive value (a), then the value of the cost function will be negative and we cannot use a negative value as cost. So, we are going to square the result, then the value of the cost function will be a positive value. But unfortunately, when we use the quadratic cost function, the way the formula works actually reduces the learning rate of the network.

Instead, we are going to use the cross-entropy function, which can be defined as follows:

This cost function allows faster learning because the larger the difference between y and a, the faster our neurons' learning rate. This means that if the network has a large difference between the predicted value and the actual value athe beginning of the model training process, then we can essentially move toward using a cost function because the larger the difference, the faster the neurons are going to learn.

There are two key components of how neural networks learn from features. First, there are neurons and their activation functions and cost functions, but we are still missing a key step: the actual learning process. So, we need to figure out how we can use the neurons and their measurement of error (the cost function) to correct our prediction or make the network learn. Up until now, we have tried to understand neurons and perceptrons, and then linked them to get a neural net. We also understand that cost functions are essentially measurements of errors. Now, we are going to fix the errors between the actual and predicted values using gradient descent and backpropagation.

主站蜘蛛池模板: 楚雄市| 富阳市| 合山市| 麻江县| 伊春市| 襄樊市| 平远县| 西城区| 台安县| 龙海市| 灵山县| 扎赉特旗| 大名县| 喜德县| 曲松县| 营口市| 华蓥市| 砚山县| 青河县| 冕宁县| 香格里拉县| 鄄城县| 兴安盟| 绥中县| 南郑县| 桂平市| 密山市| 荃湾区| 峨边| 镇巴县| 古蔺县| 淮安市| 沙坪坝区| 恩施市| 青河县| 菏泽市| 温州市| 大姚县| 宁海县| 剑川县| 侯马市|