官术网_书友最值得收藏!

Up and Running with Reinforcement Learning

This book will cover interesting topics in deep Reinforcement Learning (RL), including the more widely used algorithms, and will also provide TensorFlow code to solve many challenging problems using deep RL algorithms. Some basic knowledge of RL will help you pick up the advanced topics covered in this book, but the topics will be explained in a simple language that machine learning practitioners can grasp. The language of choice for this book is Python, and the deep learning framework used is TensorFlow, and we expect you to have a reasonable understanding of the two. If not, there are several Packt books that cover these topics. We will cover several different RL algorithms, such as Deep Q-Network (DQN), Deep Deterministic Policy Gradient (DDPG), Trust Region Policy Optimization (TRPO), and Proximal Policy Optimization (PPO), to name a few. Let's dive right into deep RL.

In this chapter, we will delve deep into the basic concepts of RL. We will learn the meaning of the RL jargon, the mathematical relationships between them, and also how to use them in an RL setting to train an agent. These concepts will lay the foundations for us to learn RL algorithms in later chapters, along with how to apply them to train agents. Happy learning! 

Some of the main topics that will be covered in this chapter are as follows:

  • Formulating the RL problem
  • Understanding what an agent and an environment are
  • Defining the Bellman equation
  • On-policy versus off-policy learning
  • Model-free versus model-based training
主站蜘蛛池模板: 望奎县| 弥渡县| 尼勒克县| 吐鲁番市| 盈江县| 铅山县| 射阳县| 喀喇沁旗| 栖霞市| 牙克石市| 同心县| 舟山市| 特克斯县| 邓州市| 宣化县| 荃湾区| 栾川县| 舒兰市| 康马县| 林州市| 石河子市| 柳江县| 同江市| 郁南县| 岑巩县| 曲周县| 合阳县| 松原市| 三门峡市| 志丹县| 库车县| 安吉县| 上饶市| 资溪县| 汕尾市| 绥德县| 永仁县| 筠连县| 桐梓县| 广元市| 灵山县|