Value function
A value function denotes how good it is for an agent to be in a particular state. It is dependent on the policy and is often denoted by v(s). It is equal to the total expected reward received by the agent starting from the initial state. There can be several value functions; the optimal value function is the one that has the highest value for all the states compared to other value functions. Similarly, an optimal policy is the one that has the optimal value function.
推薦閱讀
- 數據庫應用實戰
- InfluxDB原理與實戰
- Voice Application Development for Android
- MySQL從入門到精通(第3版)
- 數據結構與算法(C語言版)
- 大數據時代下的智能轉型進程精選(套裝共10冊)
- 數據架構與商業智能
- Python金融實戰
- 數據庫技術及應用教程
- 計算機應用基礎教程上機指導與習題集(微課版)
- 企業級容器云架構開發指南
- Splunk智能運維實戰
- Augmented Reality using Appcelerator Titanium Starter
- Web Services Testing with soapUI
- Oracle高性能SQL引擎剖析:SQL優化與調優機制詳解