The epsilon-greedy policy
- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 202字
- 2021-06-24 15:17:42
上QQ閱讀APP看后續(xù)精彩內(nèi)容
登錄訂閱本章 >
推薦閱讀
- 數(shù)據(jù)產(chǎn)品經(jīng)理高效學(xué)習(xí)手冊(cè):產(chǎn)品設(shè)計(jì)、技術(shù)常識(shí)與機(jī)器學(xué)習(xí)
- 數(shù)據(jù)挖掘原理與實(shí)踐
- Effective Amazon Machine Learning
- Neural Network Programming with TensorFlow
- 數(shù)據(jù)庫(kù)應(yīng)用基礎(chǔ)教程(Visual FoxPro 9.0)
- Remote Usability Testing
- 數(shù)據(jù)庫(kù)原理與應(yīng)用(Oracle版)
- Spark大數(shù)據(jù)分析實(shí)戰(zhàn)
- 信息學(xué)競(jìng)賽寶典:數(shù)據(jù)結(jié)構(gòu)基礎(chǔ)
- SQL Server 2012數(shù)據(jù)庫(kù)管理教程
- 數(shù)據(jù)科學(xué)實(shí)戰(zhàn)指南
- Mastering ROS for Robotics Programming(Second Edition)
- 利用Python進(jìn)行數(shù)據(jù)分析(原書第2版)
- Access 2016數(shù)據(jù)庫(kù)應(yīng)用基礎(chǔ)
- Unity for Architectural Visualization