官术网_书友最值得收藏!

Deep Q-learning

Deep Q-learning represents an evolution of the basic Q-learning method the state-action is replaced by a neural network, with the aim of approximating the optimal value function.

Compared to the previous approaches, where it was used to structure the network in order to request both input and action and providing its expected return, Deep Q-learning revolutionizes the structure in order to request only the state of the environment and supply as many status-action values as there are actions that can be performed in the environment.

主站蜘蛛池模板: 布尔津县| 确山县| 赫章县| 广元市| 平顺县| 萍乡市| 嘉禾县| 织金县| 蓝田县| 长兴县| 淮滨县| 平陆县| 辽阳市| 突泉县| 闽清县| 东台市| 东台市| 额尔古纳市| 信丰县| 容城县| 贺兰县| 彭阳县| 巴楚县| 珠海市| 临泉县| 军事| 滕州市| 凤城市| 水富县| 辉县市| 湖口县| 宿松县| 彝良县| 运城市| 沙坪坝区| 济南市| 大厂| 温州市| 工布江达县| 上饶县| 温泉县|