官术网_书友最值得收藏!

Agent environment interface

Agents are the software agents that perform actions, At, at a time, t, to move from one state, St, to another state St+1. Based on actions, agents receive a numerical reward, R, from the environment. Ultimately, RL is all about finding the optimal actions that will increase the numerical reward:

Let us understand the concept of RL with a maze game:

The objective of a maze is to reach the destination without getting stuck on the obstacles. Here's the workflow:

  • The agent is the one who travels through the maze, which is our software program/ RL algorithm
  • The environment is the maze
  • The state is the position in a maze that the agent currently resides in 
  • An agent performs an action by moving from one state to another
  • An agent receives a positive reward when its action doesn't get stuck on any obstacle and receives a negative reward when its action gets stuck on obstacles so it cannot reach the destination
  • The goal is to clear the maze and reach the destination
主站蜘蛛池模板: 蛟河市| 大港区| 襄樊市| 古田县| 栾城县| 海南省| 沁水县| 荣成市| 永靖县| 伽师县| 长丰县| 丘北县| 南开区| 兖州市| 延安市| 九台市| 谷城县| 安远县| 德保县| 内乡县| 松原市| 宝清县| 安徽省| 赣榆县| 吉隆县| 博白县| 宁都县| 舟山市| 额济纳旗| 沙河市| 年辖:市辖区| 毕节市| 布拖县| 衡山县| 文水县| 思南县| 大宁县| 临澧县| 大兴区| 双鸭山市| 同心县|