Solving multi-armed bandit problems with the softmax exploration
- PyTorch 1.x Reinforcement Learning Cookbook
- Yuxi (Hayden) Liu
- 157字
- 2021-06-24 12:35:02
上QQ閱讀APP看后續(xù)精彩內(nèi)容
登錄訂閱本章 >
推薦閱讀
- Internet接入·網(wǎng)絡(luò)安全
- 數(shù)據(jù)中心建設(shè)與管理指南
- 大數(shù)據(jù)時(shí)代的數(shù)據(jù)挖掘
- ROS機(jī)器人編程與SLAM算法解析指南
- CorelDRAW X4中文版平面設(shè)計(jì)50例
- 水晶石精粹:3ds max & ZBrush三維數(shù)字靜幀藝術(shù)
- Docker High Performance(Second Edition)
- 數(shù)據(jù)通信與計(jì)算機(jī)網(wǎng)絡(luò)
- Hybrid Cloud for Architects
- 構(gòu)建高性能Web站點(diǎn)
- JavaScript典型應(yīng)用與最佳實(shí)踐
- Deep Reinforcement Learning Hands-On
- Chef:Powerful Infrastructure Automation
- C++程序設(shè)計(jì)基礎(chǔ)(上)
- 精通LabVIEW程序設(shè)計(jì)