Questions
The question list is as follows:
- What is the Markov property?
- Why do we need the Markov Decision Process?
- When do we prefer immediate rewards?
- What is the use of the discount factor?
- Why do we use the Bellman function?
- How would you derive the Bellman equation for a Q function?
- How are the value function and Q function related?
- What is the difference between value iteration and policy iteration?
推薦閱讀
- MySQL高可用解決方案:從主從復制到InnoDB Cluster架構
- Modern Programming: Object Oriented Programming and Best Practices
- 計算機信息技術基礎實驗與習題
- 正則表達式必知必會
- 大數據導論
- Learning JavaScriptMVC
- Lean Mobile App Development
- Hadoop 3.x大數據開發實戰
- 數據庫技術實用教程
- 計算機應用基礎教程上機指導與習題集(微課版)
- Hadoop大數據開發案例教程與項目實戰(在線實驗+在線自測)
- IPython Interactive Computing and Visualization Cookbook(Second Edition)
- 新手學會計(2013-2014實戰升級版)
- 大數據分析:數據倉庫項目實戰
- Hands-On System Programming with C++