- Neural Networks with R
- Giuseppe Ciaburro Balaji Venkateswaran
- 191字
- 2021-08-20 10:25:18
Which activation functions to use?
Given that neural networks are to support nonlinearity and more complexity, the activation function to be used has to be robust enough to have the following:
- It should be differential; we will see why we need differentiation in backpropagation. It should not cause gradients to vanish.
- It should be simple and fast in processing.
- It should not be zero centered.
The sigmoid is the most used activation function, but it suffers from the following setbacks:
- Since it uses logistic model, the computations are time consuming and complex
- It cause gradients to vanish and no signals pass through the neurons at some point of time
- It is slow in convergence
- It is not zero centered
These drawbacks are solved by ReLU. ReLU is simple and is faster to process. It does not have the vanishing gradient problem and has shown vast improvements compared to the sigmoid and tanh functions. ReLU is the most preferred activation function for neural networks and DL problems.
ReLU is used for hidden layers, while the output layer can use a softmax function for logistic problems and a linear function of regression problems.
推薦閱讀
- AngularJS入門與進階
- VMware View Security Essentials
- TypeScript入門與實戰(zhàn)
- CockroachDB權威指南
- Python自然語言處理實戰(zhàn):核心技術與算法
- Mastering Yii
- Learning Probabilistic Graphical Models in R
- 劍指大數(shù)據(jù):企業(yè)級數(shù)據(jù)倉庫項目實戰(zhàn)(在線教育版)
- Creating Mobile Apps with jQuery Mobile(Second Edition)
- Scala Data Analysis Cookbook
- 匯編語言編程基礎:基于LoongArch
- Java并發(fā)編程之美
- Mastering Adobe Captivate 7
- Learning Grunt
- Wearable:Tech Projects with the Raspberry Pi Zero