Questions
The question list is as follows:
- What is the Markov property?
- Why do we need the Markov Decision Process?
- When do we prefer immediate rewards?
- What is the use of the discount factor?
- Why do we use the Bellman function?
- How would you derive the Bellman equation for a Q function?
- How are the value function and Q function related?
- What is the difference between value iteration and policy iteration?
推薦閱讀
- 數據分析實戰:基于EXCEL和SPSS系列工具的實踐
- DB29forLinux,UNIX,Windows數據庫管理認證指南
- 新型數據庫系統:原理、架構與實踐
- 云計算與大數據應用
- Dependency Injection with AngularJS
- 大數據營銷:如何讓營銷更具吸引力
- 基于Apache CXF構建SOA應用
- Power BI商業數據分析完全自學教程
- 大數據精準挖掘
- 大數據數學基礎(Python語言描述)
- 商業智能工具應用與數據可視化
- 企業級大數據項目實戰:用戶搜索行為分析系統從0到1
- Artificial Intelligence for Big Data
- Access數據庫教程(2010版)
- Python數據分析入門與實戰