- TensorFlow Reinforcement Learning Quick Start Guide
- Kaushik Balakrishnan
- 85字
- 2021-06-24 15:29:09
Understanding SARSA and Q-Learning
In this section, we will learn about SARSA and Q-Learning and how can they are coded with Python. Before we go further, let's find out what SARSA and Q-Learning are. SARSA is an algorithm that uses the state-action Q values to update. These concepts are derived from the computer science field of dynamic programming, while Q-learning is an off-policy algorithm that was first proposed by Christopher Watkins in 1989, and is a widely used RL algorithm.
推薦閱讀
- 數據運營之路:掘金數據化時代
- Hands-On Linux for Architects
- 大型數據庫管理系統技術、應用與實例分析:SQL Server 2005
- ESP8266 Home Automation Projects
- 網絡化分布式系統預測控制
- 愛犯錯的智能體
- Hadoop應用開發基礎
- HTML5 Canvas Cookbook
- Hands-On Data Warehousing with Azure Data Factory
- 大數據案例精析
- Artificial Intelligence By Example
- The DevOps 2.1 Toolkit:Docker Swarm
- 貫通開源Web圖形與報表技術全集
- Photoshop CS4數碼照片處理入門、進階與提高
- 新世紀Photoshop CS6中文版應用教程