書名： TensorFlow Reinforcement Learning Quick Start Guide
作者名： Kaushik Balakrishnan
本章字?jǐn)?shù)： 35字
更新時(shí)間： 2021-06-24 15:29:07

On-policy versus off-policy learning

RL algorithms can be classified as on-policy or off-policy. We will now learn about both of these classes and how to distinguish a given RL algorithm into one or the other.

官术网_书友最值得收藏!

TensorFlow Reinforcement Learning Quick Start Guide

On-policy versus off-policy learning