書名： Deep Reinforcement Learning Hands-On
作者名： Maxim Lapan
本章字數： 92字
更新時間： 2021-06-25 20:46:56

Chapter 5. Tabular Learning and the Bellman Equation

In the previous chapter, we got acquainted with our first Reinforcement Learning (RL) method, cross-entropy, and saw its strengths and weaknesses. In this new part of the book, we'll look at another group of methods, called Q-learning, which have much more flexibility and power.

This chapter will establish the required background shared by those methods. We'll also revisit the FrozenLake environment and show how new concepts will fit with this environment and help us to address the issues of the environment's uncertainty.

官术网_书友最值得收藏!

Deep Reinforcement Learning Hands-On

Chapter 5. Tabular Learning and the Bellman Equation