官术网_书友最值得收藏!

  • Python Reinforcement Learning
  • Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
  • 123字
  • 2021-06-24 15:17:31

Episodic and continuous tasks

Episodic tasks are the tasks that have a terminal state (end). In RL, episodes are considered agent-environment interactions from initial to final states.

For example, in a car racing video game, you start the game (initial state) and play the game until it is over (final state). This is called an episode. Once the game is over, you start the next episode by restarting the game, and you will begin from the initial state irrespective of the position you were in the previous game. So, each episode is independent of the other.

In a continuous task, there is not a terminal state. Continuous tasks will never end. For example, a personal assistance robot does not have a terminal state.

主站蜘蛛池模板: 虎林市| 诸暨市| 嵩明县| 怀仁县| 长寿区| 合阳县| 高台县| 砚山县| 封开县| 来安县| 石城县| 建宁县| 威海市| 青川县| 昭觉县| 永顺县| 洛阳市| 北碚区| 方城县| 张家港市| 陇西县| 潜山县| 潞城市| 屏山县| 泰兴市| 台南县| 上思县| 循化| 甘德县| 分宜县| 凌云县| 长宁县| 荣成市| 德保县| 伊宁市| 长治县| 清苑县| 温宿县| 格尔木市| 武清区| 江西省|