- TensorFlow Reinforcement Learning Quick Start Guide
- Kaushik Balakrishnan
- 258字
- 2021-06-24 15:29:05
Up and Running with Reinforcement Learning
This book will cover interesting topics in deep Reinforcement Learning (RL), including the more widely used algorithms, and will also provide TensorFlow code to solve many challenging problems using deep RL algorithms. Some basic knowledge of RL will help you pick up the advanced topics covered in this book, but the topics will be explained in a simple language that machine learning practitioners can grasp. The language of choice for this book is Python, and the deep learning framework used is TensorFlow, and we expect you to have a reasonable understanding of the two. If not, there are several Packt books that cover these topics. We will cover several different RL algorithms, such as Deep Q-Network (DQN), Deep Deterministic Policy Gradient (DDPG), Trust Region Policy Optimization (TRPO), and Proximal Policy Optimization (PPO), to name a few. Let's dive right into deep RL.
In this chapter, we will delve deep into the basic concepts of RL. We will learn the meaning of the RL jargon, the mathematical relationships between them, and also how to use them in an RL setting to train an agent. These concepts will lay the foundations for us to learn RL algorithms in later chapters, along with how to apply them to train agents. Happy learning!
Some of the main topics that will be covered in this chapter are as follows:
- Formulating the RL problem
- Understanding what an agent and an environment are
- Defining the Bellman equation
- On-policy versus off-policy learning
- Model-free versus model-based training
- Ansible Configuration Management
- Design for the Future
- 大數(shù)據(jù)挑戰(zhàn)與NoSQL數(shù)據(jù)庫技術(shù)
- 人工智能與人工生命
- Spark大數(shù)據(jù)技術(shù)與應(yīng)用
- 四向穿梭式自動化密集倉儲系統(tǒng)的設(shè)計與控制
- Ceph:Designing and Implementing Scalable Storage Systems
- 塊數(shù)據(jù)5.0:數(shù)據(jù)社會學(xué)的理論與方法
- 傳感器與新聞
- Machine Learning with Apache Spark Quick Start Guide
- 手機游戲程序開發(fā)
- Mastering Exploratory Analysis with pandas
- 貫通開源Web圖形與報表技術(shù)全集
- Windows 7故障與技巧200例
- 玩轉(zhuǎn)PowerPoint