- TensorFlow Reinforcement Learning Quick Start Guide
- Kaushik Balakrishnan
- 85字
- 2021-06-24 15:29:09
Understanding SARSA and Q-Learning
In this section, we will learn about SARSA and Q-Learning and how can they are coded with Python. Before we go further, let's find out what SARSA and Q-Learning are. SARSA is an algorithm that uses the state-action Q values to update. These concepts are derived from the computer science field of dynamic programming, while Q-learning is an off-policy algorithm that was first proposed by Christopher Watkins in 1989, and is a widely used RL algorithm.
推薦閱讀