官术网_书友最值得收藏!

Agent environment interface

Agents are the software agents that perform actions, At, at a time, t, to move from one state, St, to another state St+1. Based on actions, agents receive a numerical reward, R, from the environment. Ultimately, RL is all about finding the optimal actions that will increase the numerical reward:

Let us understand the concept of RL with a maze game:

The objective of a maze is to reach the destination without getting stuck on the obstacles. Here's the workflow:

  • The agent is the one who travels through the maze, which is our software program/ RL algorithm
  • The environment is the maze
  • The state is the position in a maze that the agent currently resides in 
  • An agent performs an action by moving from one state to another
  • An agent receives a positive reward when its action doesn't get stuck on any obstacle and receives a negative reward when its action gets stuck on obstacles so it cannot reach the destination
  • The goal is to clear the maze and reach the destination
主站蜘蛛池模板: 宜昌市| 昭苏县| 微山县| 沂南县| 新平| 台江县| 武功县| 正定县| 浦江县| 清镇市| 蓝山县| 司法| 夏邑县| 胶州市| 正宁县| 渑池县| 灵山县| 阿图什市| 遂川县| 巴楚县| 北川| 龙游县| 广饶县| 西乌珠穆沁旗| 广南县| 永春县| 抚顺市| 乌鲁木齐市| 云阳县| 蒙自县| 闽侯县| 铜陵市| 建阳市| 类乌齐县| 横山县| 廉江市| 监利县| 寿光市| 东至县| 本溪市| 阿坝县|