Lunar Lander using policy gradients
- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 722字
- 2021-06-24 15:17:57
上QQ閱讀APP看后續(xù)精彩內(nèi)容
登錄訂閱本章 >
推薦閱讀
- 數(shù)據(jù)庫(kù)原理及應(yīng)用教程(第4版)(微課版)
- 從零開(kāi)始學(xué)Hadoop大數(shù)據(jù)分析(視頻教學(xué)版)
- SQL Server 2012數(shù)據(jù)庫(kù)技術(shù)與應(yīng)用(微課版)
- 正則表達(dá)式必知必會(huì)
- 從0到1:JavaScript 快速上手
- MATLAB Graphics and Data Visualization Cookbook
- Apache Kylin權(quán)威指南
- 企業(yè)級(jí)容器云架構(gòu)開(kāi)發(fā)指南
- IPython Interactive Computing and Visualization Cookbook(Second Edition)
- HikariCP連接池實(shí)戰(zhàn)
- 大數(shù)據(jù)分析:數(shù)據(jù)倉(cāng)庫(kù)項(xiàng)目實(shí)戰(zhàn)
- Gideros Mobile Game Development
- 數(shù)據(jù)中心經(jīng)營(yíng)之道
- AndEngine for Android Game Development Cookbook
- 區(qū)塊鏈應(yīng)用開(kāi)發(fā)指南:業(yè)務(wù)場(chǎng)景剖析與實(shí)戰(zhàn)