官术网_书友最值得收藏!

Chapter 5. Tabular Learning and the Bellman Equation

In the previous chapter, we got acquainted with our first Reinforcement Learning (RL) method, cross-entropy, and saw its strengths and weaknesses. In this new part of the book, we'll look at another group of methods, called Q-learning, which have much more flexibility and power.

This chapter will establish the required background shared by those methods. We'll also revisit the FrozenLake environment and show how new concepts will fit with this environment and help us to address the issues of the environment's uncertainty.

主站蜘蛛池模板: 水富县| 虞城县| 玉龙| 阿尔山市| 辽宁省| 永修县| 乐业县| 吴川市| 保康县| 丹寨县| 天门市| 汝城县| 通化县| 洛南县| 宜州市| 大余县| 噶尔县| 呼和浩特市| 同江市| 惠来县| 同心县| 阳新县| 冕宁县| 怀远县| 保康县| 东莞市| 安岳县| 东源县| 五莲县| 武平县| 龙口市| 黄骅市| 栾城县| 九龙县| 龙江县| 吴旗县| 洮南市| 密山市| 岳普湖县| 大港区| 天祝|