官术网_书友最值得收藏!

Deep Q-learning

Deep Q-learning represents an evolution of the basic Q-learning method the state-action is replaced by a neural network, with the aim of approximating the optimal value function.

Compared to the previous approaches, where it was used to structure the network in order to request both input and action and providing its expected return, Deep Q-learning revolutionizes the structure in order to request only the state of the environment and supply as many status-action values as there are actions that can be performed in the environment.

主站蜘蛛池模板: 吉林省| 许昌市| 四平市| 丰城市| 融水| 晴隆县| 赤壁市| 徐水县| 渝北区| 靖宇县| 宁都县| 宜宾市| 乾安县| 惠安县| 会同县| 巴青县| 阜南县| 江津市| 甘孜县| 枣庄市| 清新县| 临城县| 浙江省| 广丰县| 闽清县| 黄山市| 德安县| 四会市| 淮阳县| 黑水县| 壤塘县| 保靖县| 顺昌县| 黄山市| 宣恩县| 保康县| 长汀县| 左贡县| 湖北省| 远安县| 葫芦岛市|