- TensorFlow Reinforcement Learning Quick Start Guide
- Kaushik Balakrishnan
- 35字
- 2021-06-24 15:29:07
On-policy versus off-policy learning
RL algorithms can be classified as on-policy or off-policy. We will now learn about both of these classes and how to distinguish a given RL algorithm into one or the other.
推薦閱讀
- AutoCAD繪圖實(shí)用速查通典
- 腦動(dòng)力:Linux指令速查效率手冊(cè)
- 大數(shù)據(jù)戰(zhàn)爭(zhēng):人工智能時(shí)代不能不說(shuō)的事
- JavaScript實(shí)例自學(xué)手冊(cè)
- 腦動(dòng)力:PHP函數(shù)速查效率手冊(cè)
- HBase Design Patterns
- ROS機(jī)器人編程與SLAM算法解析指南
- 人工智能工程化:應(yīng)用落地與中臺(tái)構(gòu)建
- 傳感器與物聯(lián)網(wǎng)技術(shù)
- 計(jì)算機(jī)網(wǎng)絡(luò)安全
- 數(shù)據(jù)庫(kù)系統(tǒng)原理及應(yīng)用教程(第5版)
- 愛(ài)犯錯(cuò)的智能體
- Visual FoxPro數(shù)據(jù)庫(kù)基礎(chǔ)及應(yīng)用
- 零起點(diǎn)學(xué)西門子S7-200 PLC
- Visual C++項(xiàng)目開(kāi)發(fā)案例精粹