官术网_书友最值得收藏!

The sigmoid function

Sigmoid is a smooth and continuously differentiable function. It results in nonlinear output. The sigmoid function is represented here:

Please, look at the observations in the following graph of the sigmoid function. The function ranges from 0 to 1. Observing the curve of the function, we see that the gradient is very high when x values between -3 and 3, but becomes flat beyond that. Thus, we can say that small changes in x near these points will bring large changes in the value of the sigmoid function. Therefore, the function goals in pushing the values of the sigmoid function towards the extremes.

Therefore, it's being used in classification problems:

Looking at the gradient of the following sigmoid function, we observe a smooth curve dependent on x. Since the gradient curve is continuous, it's easy to backpropagate the error and update the parameters, that is, and :

Sigmoids are widely used but its disadvantage is that the function goes flat beyond +3 and -3. Thus, whenever the function falls in that region, the gradients tends to approach zero and the learning of our neural network comes to a halt.

Since the sigmoid function outputs values from 0 to 1, that is, all positive, it's non symmetrical around the origin and all output signals are positive, that is, of the same sign. To tackle this, the sigmoid function has been scaled to the tanh function, which we will study next. Moreover, since the gradient results in a very small value, it's susceptible to the vanishing gradient problem (which we will discuss later in this chapter).

主站蜘蛛池模板: 新兴县| 河源市| 紫金县| 耒阳市| 横峰县| 扎赉特旗| 大连市| 大余县| 杭锦后旗| 时尚| 邵阳县| 泗阳县| 石泉县| 长乐市| 西青区| 黄大仙区| 夹江县| 和田市| 海兴县| 青川县| 阜新市| 龙门县| 临泉县| 泸西县| 乐都县| 常宁市| 色达县| 综艺| 长春市| 商水县| 英德市| 新野县| 遂宁市| 西华县| 晋江市| 枞阳县| 璧山县| 瑞金市| 绩溪县| 娄烦县| 乌恰县|