官术网_书友最值得收藏!

Summary

My congratulations, you've made another step towards understanding modern, state-of-the-art RL methods! We learned about some very important concepts that are widely used in deep RL: the value of state, the value of actions, and the Bellman equation in various forms. We saw the value iteration method, which is a very important building block in the area of Q-learning. Finally, we got to know how value iteration can improve our FrozenLake solution.

In the next chapter, we'll learn about deep Q-networks, which started the deep RL revolution in 2013, by beating humans on lots of Atari 2600 games.

主站蜘蛛池模板: 阿拉善左旗| 吉木萨尔县| 连城县| 康定县| 枣强县| 大连市| 常熟市| 朔州市| 丹东市| 甘肃省| 晋州市| 翁牛特旗| 基隆市| 梁河县| 牙克石市| 成武县| 商都县| 平顺县| 长武县| 轮台县| 灵山县| 阿巴嘎旗| 栾城县| 曲水县| 桂林市| 岳西县| 昭苏县| 龙川县| 关岭| 四川省| 双鸭山市| 台中县| 龙门县| 谢通门县| 枝江市| 绵竹市| 南涧| 阜平县| 新田县| 新宾| 宜君县|