官术网_书友最值得收藏!

  • Deep Learning with Keras
  • Antonio Gulli Sujit Pal
  • 74字
  • 2021-07-02 23:58:05

Increasing the size of batch computation

Gradient descent tries to minimize the cost function on all the examples provided in the training sets and, at the same time, for all the features provided in the input. Stochastic gradient descent is a much less expensive variant, which considers only BATCH_SIZE examples. So, let's see what the behavior is by changing this parameter. As you can see, the optimal accuracy value is reached for BATCH_SIZE=128:

主站蜘蛛池模板: 新津县| 雷州市| 会昌县| 固镇县| 永嘉县| 棋牌| 白沙| 绵竹市| 宿松县| 任丘市| 青州市| 大洼县| 孝感市| 马边| 望奎县| 贡觉县| 无锡市| 渭源县| 繁昌县| 蕉岭县| 武强县| 上蔡县| 嘉峪关市| 吉水县| 南召县| 平定县| 同心县| 和龙市| 霍林郭勒市| 灵武市| 侯马市| 永春县| 新安县| 桃江县| 翁源县| 永福县| 云浮市| 湘潭市| 南乐县| 长兴县| 克拉玛依市|