官术网_书友最值得收藏!

Sigmoid

As you have already seen, the sigmoid function is a particular case of the logistic function, and it gives us something similar to a step function; therefore, it's useful for binary classifications, indicating a probability as a result. The function is differentiable; therefore, we can run gradient descent for every point. It's also monotonic, which means that it always increases or decreases, but its derivative does not; therefore, it will have a minima. It forces all output values to be between 0 and 1. Because of this, even very high values asymptotically tend to one and very low to 0. One problem that this creates is that the derivative in those points is approximately 0; therefore, the gradient descent process will not find a local minima for very high or very low values, as shown in the following diagram:

主站蜘蛛池模板: 惠水县| 湘乡市| 沾化县| 朝阳市| 连云港市| 泰顺县| 浦江县| 全椒县| 开鲁县| 河津市| 明溪县| 无为县| 琼海市| 登封市| 社会| 麦盖提县| 荆州市| 丹东市| 拜城县| 金湖县| 黄石市| 甘泉县| 上饶市| 离岛区| 武义县| 洛宁县| 赤水市| 长沙市| 精河县| 寿阳县| 隆回县| 蒙自县| 仪征市| 阳信县| 滨州市| 临武县| 绥江县| 鸡泽县| 新巴尔虎左旗| 云南省| 宜君县|