官术网_书友最值得收藏!

Summary

In this chapter, we became familiar with the first RL method cross-entropy, which is simple but quite powerful, despite its limitations. We applied it to a CartPole environment (with huge success) and to FrozenLake (with much more modest success). This chapter ends the introductory part of the book.

In the upcoming chapters, we will explore more complex, but more powerful tools of deep RL.

主站蜘蛛池模板: 册亨县| 丹棱县| 钦州市| 噶尔县| 桦川县| 分宜县| 木兰县| 永清县| 姜堰市| 保德县| 洪湖市| 银川市| 石柱| 汶上县| 杂多县| 磴口县| 无极县| 思南县| 昭觉县| 仁寿县| 苗栗县| 寻乌县| 丰原市| 阜新市| 府谷县| 大田县| 敖汉旗| 榕江县| 灵丘县| 濮阳市| 吉木萨尔县| 静海县| 景谷| 南昌市| 武山县| 寿宁县| 仁寿县| 柘城县| 望都县| 通州区| 绥棱县|