官术网_书友最值得收藏!

  • Python Reinforcement Learning
  • Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
  • 123字
  • 2021-06-24 15:17:31

Episodic and continuous tasks

Episodic tasks are the tasks that have a terminal state (end). In RL, episodes are considered agent-environment interactions from initial to final states.

For example, in a car racing video game, you start the game (initial state) and play the game until it is over (final state). This is called an episode. Once the game is over, you start the next episode by restarting the game, and you will begin from the initial state irrespective of the position you were in the previous game. So, each episode is independent of the other.

In a continuous task, there is not a terminal state. Continuous tasks will never end. For example, a personal assistance robot does not have a terminal state.

主站蜘蛛池模板: 巴马| 天台县| 嘉禾县| 苗栗市| 镇原县| 霞浦县| 二连浩特市| 海阳市| 灯塔市| 仁化县| 麻江县| 翼城县| 辉县市| 伽师县| 天镇县| 苗栗县| 乌拉特前旗| 延津县| 高邑县| 左云县| 巴青县| 安阳县| 东明县| 景谷| 汕尾市| 滨海县| 嘉峪关市| 孟津县| 高雄县| 全椒县| 祁东县| 无棣县| 孟连| 苗栗县| 绍兴县| 广汉市| 邹平县| 浦县| 唐海县| 长海县| 固安县|