官术网_书友最值得收藏!

  • Neural Networks with R
  • Giuseppe Ciaburro Balaji Venkateswaran
  • 191字
  • 2021-08-20 10:25:18

Which activation functions to use?

Given that neural networks are to support nonlinearity and more complexity, the activation function to be used has to be robust enough to have the following:

  • It should be differential; we will see why we need differentiation in backpropagation. It should not cause gradients to vanish.
  • It should be simple and fast in processing.
  • It should not be zero centered.

The sigmoid is the most used activation function, but it suffers from the following setbacks:

  • Since it uses logistic model, the computations are time consuming and complex
  • It cause gradients to vanish and no signals pass through the neurons at some point of time
  • It is slow in convergence
  • It is not zero centered

These drawbacks are solved by ReLU. ReLU is simple and is faster to process. It does not have the vanishing gradient problem and has shown vast improvements compared to the sigmoid and tanh functions. ReLU is the most preferred activation function for neural networks and DL problems.

ReLU is used for hidden layers, while the output layer can use a softmax function for logistic problems and a linear function of regression problems.

主站蜘蛛池模板: 丹寨县| 南川市| 宜丰县| 浮梁县| 东海县| 修水县| 吴忠市| 九江县| 乐东| 诸暨市| 南和县| 绿春县| 铜山县| 龙海市| 日喀则市| 阳春市| 哈尔滨市| 昆明市| 托克逊县| 余江县| 米易县| 米脂县| 沐川县| 壤塘县| 偏关县| 尚志市| 长垣县| 确山县| 修武县| 周口市| 上蔡县| 永新县| 唐山市| 土默特左旗| 长沙县| 嵊泗县| 棋牌| 乌海市| 柘城县| 健康| 萨迦县|