官术网_书友最值得收藏!

Why do we use xavier initialization?

The following factors call for the application of xavier initialization:

  • If the weights in a network start very small, most of the signals will shrink and become dormant at the activation function in the later layers

  • If the weights start very large, most of the signals will massively grow and pass through the activation functions in the later layers

Thus, xavier initialization helps in generating optimal weights, such that the signals are within optimal range, thereby minimizing the chances of the signals getting neither too small nor too large.

The derivation of the preceding formula is beyond the scope of this book. Feel free to search here (http://andyljones.tumblr.com/post/110998971763/an-explanation-of-xavier-initialization) and go through the derivation for a better understanding.

主站蜘蛛池模板: 县级市| 阿城市| 温州市| 张家港市| 绿春县| 板桥市| 达拉特旗| 兰西县| 宁化县| 新蔡县| 涟水县| 梨树县| 三河市| 江永县| 鲁山县| 牡丹江市| 本溪市| 中西区| 建始县| 安庆市| 吉木萨尔县| 德惠市| 伽师县| 六安市| 碌曲县| 莱州市| 邯郸县| 林周县| 安康市| 搜索| 嘉祥县| 绥宁县| 屏边| 东乌珠穆沁旗| 西畴县| 望都县| 长沙县| 井研县| 庄河市| 双辽市| 南丰县|