- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 104字
- 2021-06-24 15:17:22
Policy function
A policy defines the agent's behavior in an environment. The way in which the agent decides which action to perform depends on the policy. Say you want to reach your office from home; there will be different routes to reach your office, and some routes are shortcuts, while some routes are long. These routes are called policies because they represent the way in which we choose to perform an action to reach our goal. A policy is often denoted by the symbol ??. A policy can be in the form of a lookup table or a complex search process.
推薦閱讀
- Test-Driven Development with Mockito
- Libgdx Cross/platform Game Development Cookbook
- iOS and OS X Network Programming Cookbook
- Spark核心技術與高級應用
- 數據革命:大數據價值實現方法、技術與案例
- PostgreSQL指南:內幕探索
- 數據庫原理與應用
- 中文版Access 2007實例與操作
- Oracle高性能SQL引擎剖析:SQL優化與調優機制詳解
- 活用數據:驅動業務的數據分析實戰
- openGauss數據庫核心技術
- Kubernetes快速進階與實戰
- Machine Learning for Mobile
- 工業大數據分析實踐
- SQL必知必會(第5版)