- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 104字
- 2021-06-24 15:17:22
Policy function
A policy defines the agent's behavior in an environment. The way in which the agent decides which action to perform depends on the policy. Say you want to reach your office from home; there will be different routes to reach your office, and some routes are shortcuts, while some routes are long. These routes are called policies because they represent the way in which we choose to perform an action to reach our goal. A policy is often denoted by the symbol ??. A policy can be in the form of a lookup table or a complex search process.
推薦閱讀
- 數據分析實戰:基于EXCEL和SPSS系列工具的實踐
- 大數據可視化
- 圖解機器學習算法
- Creating Dynamic UIs with Android Fragments(Second Edition)
- INSTANT Cytoscape Complex Network Analysis How-to
- 智能數據分析:入門、實戰與平臺構建
- 數字媒體交互設計(初級):Web產品交互設計方法與案例
- Hands-On Mathematics for Deep Learning
- 白話大數據與機器學習
- 企業級容器云架構開發指南
- 新手學會計(2013-2014實戰升級版)
- 計算機視覺
- 云計算
- 智能與數據重構世界
- 數據迷霧:洞察數據的價值與內涵