Cliff walking example of on-policy and off-policy of TD control
- Statistics for Machine Learning
- Pratap Dangeti
- 948字
- 2021-07-02 19:06:31
上QQ閱讀APP看后續精彩內容
登錄訂閱本章 >
推薦閱讀
- BeagleBone Media Center
- Java開發入行真功夫
- Visual C
- Monitoring Elasticsearch
- Mastering Apache Spark 2.x(Second Edition)
- 零基礎學Python網絡爬蟲案例實戰全流程詳解(高級進階篇)
- Android開發:從0到1 (清華開發者書庫)
- C#開發案例精粹
- Mastering React
- 單片機原理及應用技術
- INSTANT JQuery Flot Visual Data Analysis
- Learning Concurrency in Python
- PHP項目開發全程實錄(第4版)
- Java多線程并發體系實戰(微課視頻版)
- C++ Data Structures and Algorithm Design Principles