官术网_书友最值得收藏!

Chapter 5. Tabular Learning and the Bellman Equation

In the previous chapter, we got acquainted with our first Reinforcement Learning (RL) method, cross-entropy, and saw its strengths and weaknesses. In this new part of the book, we'll look at another group of methods, called Q-learning, which have much more flexibility and power.

This chapter will establish the required background shared by those methods. We'll also revisit the FrozenLake environment and show how new concepts will fit with this environment and help us to address the issues of the environment's uncertainty.

主站蜘蛛池模板: 平昌县| 凤凰县| 无锡市| 绥棱县| 永顺县| 隆子县| 呼图壁县| 寿光市| 山东| 张家港市| 响水县| 金沙县| 汽车| 牙克石市| 太湖县| 光泽县| 永昌县| 高陵县| 格尔木市| 藁城市| 秦安县| 丹凤县| 湘潭县| 丽水市| 丰宁| 昌黎县| 余庆县| 安远县| 永安市| 共和县| 惠州市| 寿阳县| 景泰县| 南城县| 拜城县| 分宜县| 改则县| 绥化市| 襄樊市| 叙永县| 长岛县|