官术网_书友最值得收藏!

Overcoming the limitations of deep learning

These two possible problems can be overcome by:

  • Minimizing the use of the sigmoid and tanh activation functions
  • Using a momentum-based stochastic gradient descent
  • Proper initialization of weights and biases, such as xavier initialization
  • Regularization (add regularization loss along with data loss and minimize that)
For more detail, along with mathematical representations of the vanishing and exploding gradient, you can read this article: Intelligent Signals : Unstable Deep Learning. Why and How to solve them ?
主站蜘蛛池模板: 屏山县| 湾仔区| 登封市| 诸城市| 莱西市| 紫云| 花垣县| 泽普县| 谷城县| 四川省| 册亨县| 南昌县| 从江县| 阿拉善左旗| 彰武县| 年辖:市辖区| 昌宁县| 屯昌县| 镇江市| 石台县| 东平县| 汕尾市| 宜章县| 怀集县| 阜阳市| 华宁县| 武宁县| 普陀区| 昌吉市| 渝中区| 榆树市| 志丹县| 黔江区| 高陵县| 沛县| 阳东县| 璧山县| 乌什县| 台湾省| 油尖旺区| 泰顺县|