官术网_书友最值得收藏!

  • Deep Learning with Keras
  • Antonio Gulli Sujit Pal
  • 74字
  • 2021-07-02 23:58:05

Increasing the size of batch computation

Gradient descent tries to minimize the cost function on all the examples provided in the training sets and, at the same time, for all the features provided in the input. Stochastic gradient descent is a much less expensive variant, which considers only BATCH_SIZE examples. So, let's see what the behavior is by changing this parameter. As you can see, the optimal accuracy value is reached for BATCH_SIZE=128:

主站蜘蛛池模板: 彭州市| 潼关县| 蓝田县| 克拉玛依市| 西乡县| 聊城市| 宣威市| 建阳市| 江陵县| 凤山市| 开化县| 东平县| 宁波市| 乌兰浩特市| 沈丘县| 永泰县| 永昌县| 吉安市| 阳信县| 惠安县| 伊春市| 肥西县| 安阳市| 祁门县| 平谷区| 宁陕县| 海南省| 遂川县| 抚顺市| 县级市| 阳山县| 沅陵县| 昌邑市| 临西县| 竹溪县| 乌鲁木齐县| 城口县| 洪洞县| 双流县| 和田市| 乐业县|