書名： Deep Learning Essentials
作者名： Wei Di Anurag Bhardwaj Jianing Wei
本章字數： 126字
更新時間： 2021-06-30 19:17:52

Leaky ReLU and maxout

A Leaky ReLU will have a small slope α on the negative side, such as 0.01. The slope α can also be made into a parameter of each neuron, such as in PReLU neurons (P stands for parametric). The problem with this activation function is the inconsistency of the effectiveness of such modifications to various problems.

Maxout is another attempt to solve the dead neuron problem in ReLU. It takes the form . From this form, we can see that both ReLU and leaky ReLU are just special cases of this form, that is, for ReLU, it's . Although it benefits from linearity and having no saturation, it has doubled the number of parameters for every single neuron.

官术网_书友最值得收藏!

Deep Learning Essentials

Leaky ReLU and maxout