官术网_书友最值得收藏!

Summary

In this chapter, we became familiar with the first RL method cross-entropy, which is simple but quite powerful, despite its limitations. We applied it to a CartPole environment (with huge success) and to FrozenLake (with much more modest success). This chapter ends the introductory part of the book.

In the upcoming chapters, we will explore more complex, but more powerful tools of deep RL.

主站蜘蛛池模板: 深水埗区| 云林县| 德安县| 中阳县| 六安市| 上思县| 旺苍县| 巫溪县| 库车县| 台东市| 株洲县| 永德县| 会宁县| 辽宁省| 神池县| 太谷县| 连江县| 会宁县| 绥芬河市| 藁城市| 松阳县| 剑川县| 镇赉县| 宝清县| 上栗县| 石城县| 海阳市| 吉安市| 胶州市| 雅江县| 将乐县| 涟水县| 香河县| 辰溪县| 古浪县| 弥勒县| 嘉禾县| 双柏县| 嘉义县| 大丰市| 常州市|