書名： TensorFlow Reinforcement Learning Quick Start Guide
作者名： Kaushik Balakrishnan
本章字數： 85字
更新時間： 2021-06-24 15:29:09

Understanding SARSA and Q-Learning

In this section, we will learn about SARSA and Q-Learning and how can they are coded with Python. Before we go further, let's find out what SARSA and Q-Learning are. SARSA is an algorithm that uses the state-action Q values to update. These concepts are derived from the computer science field of dynamic programming, while Q-learning is an off-policy algorithm that was first proposed by Christopher Watkins in 1989, and is a widely used RL algorithm.

官术网_书友最值得收藏!

TensorFlow Reinforcement Learning Quick Start Guide

Understanding SARSA and Q-Learning