官术网_书友最值得收藏!

Summary

My congratulations, you've made another step towards understanding modern, state-of-the-art RL methods! We learned about some very important concepts that are widely used in deep RL: the value of state, the value of actions, and the Bellman equation in various forms. We saw the value iteration method, which is a very important building block in the area of Q-learning. Finally, we got to know how value iteration can improve our FrozenLake solution.

In the next chapter, we'll learn about deep Q-networks, which started the deep RL revolution in 2013, by beating humans on lots of Atari 2600 games.

主站蜘蛛池模板: 叶城县| 望江县| 和田市| 虹口区| 山东| 陕西省| 大庆市| 巫溪县| 上犹县| 永春县| 华蓥市| 开封县| 渭源县| 西乌| 叙永县| 河间市| 沙田区| 昭觉县| 武隆县| 凌云县| 泰来县| 会宁县| 平利县| 姜堰市| 佛冈县| 拉孜县| 文水县| 武陟县| 永靖县| 咸阳市| 司法| 江华| 承德县| 平顺县| 突泉县| 铁岭县| 故城县| 安化县| 喀喇沁旗| 博乐市| 石门县|