官术网_书友最值得收藏!

  • Neural Networks with R
  • Giuseppe Ciaburro Balaji Venkateswaran
  • 191字
  • 2021-08-20 10:25:18

Which activation functions to use?

Given that neural networks are to support nonlinearity and more complexity, the activation function to be used has to be robust enough to have the following:

  • It should be differential; we will see why we need differentiation in backpropagation. It should not cause gradients to vanish.
  • It should be simple and fast in processing.
  • It should not be zero centered.

The sigmoid is the most used activation function, but it suffers from the following setbacks:

  • Since it uses logistic model, the computations are time consuming and complex
  • It cause gradients to vanish and no signals pass through the neurons at some point of time
  • It is slow in convergence
  • It is not zero centered

These drawbacks are solved by ReLU. ReLU is simple and is faster to process. It does not have the vanishing gradient problem and has shown vast improvements compared to the sigmoid and tanh functions. ReLU is the most preferred activation function for neural networks and DL problems.

ReLU is used for hidden layers, while the output layer can use a softmax function for logistic problems and a linear function of regression problems.

主站蜘蛛池模板: 广东省| 清丰县| 黔南| 曲靖市| 天气| 台北市| 公安县| 方城县| 射洪县| 雅安市| 平江县| 晋中市| 克山县| 长岛县| 图们市| 南平市| 五大连池市| 浦江县| 鄂托克旗| 介休市| 黔西县| 沙坪坝区| 阿巴嘎旗| 霍州市| 贡山| 宽甸| 洛浦县| 封丘县| 同德县| 兰考县| 吐鲁番市| 泸溪县| 新河县| 开平市| 武强县| 成安县| 车致| 吴旗县| 黎川县| 乐清市| 平利县|