Solving multi-armed bandit problems with the softmax exploration
- PyTorch 1.x Reinforcement Learning Cookbook
- Yuxi (Hayden) Liu
- 157字
- 2021-06-24 12:35:02
上QQ閱讀APP看后續(xù)精彩內(nèi)容
登錄訂閱本章 >
推薦閱讀
- Splunk 7 Essentials(Third Edition)
- 大數(shù)據(jù)技術(shù)基礎(chǔ)
- SCRATCH與機(jī)器人
- 網(wǎng)頁編程技術(shù)
- 機(jī)艙監(jiān)測與主機(jī)遙控
- 工業(yè)機(jī)器人入門實用教程(KUKA機(jī)器人)
- 大數(shù)據(jù)技術(shù)與應(yīng)用
- Moodle Course Design Best Practices
- 生物3D打?。簭尼t(yī)療輔具制造到細(xì)胞打印
- Mastering Text Mining with R
- Mastering Ansible(Second Edition)
- JRuby語言實戰(zhàn)技術(shù)
- Learn Microsoft Azure
- 計算機(jī)組裝與維修實訓(xùn)
- DynamoDB Applied Design Patterns