官术网_书友最值得收藏!

Programming an agent using an OpenAI Gym environment

The environment considered for this section is the Frozen Lake v0. The actual documentation of the concerned environment can be found at https://gym.openai.com/envs/FrozenLake-v0/.

This environment consists of 4 x 4 grids representing a lake. Thus, we have 16 grid blocks, where each block can be a start block(S), frozen block(F), goal block(G), or a hole block(H). Thus, the objective of the agent is to learn to navigate from start to goal without falling in the hole:

import Gym
env = Gym.make('FrozenLake-v0') #loads the environment FrozenLake-v0
env.render() # will output the environment and position of the agent

-------------------
S
FFF FHFH FFFH HFFG

At any given state, an agent has four actions to perform, which are up, down, left, and right. The reward at each step is 0 except the one leading to the goal state, then the reward would be 1. We start from the S state and our goal is to reach the G state without landing up in the H state in the most optimized path through the F states.

主站蜘蛛池模板: 黎城县| 临安市| 孙吴县| 阳新县| 托克托县| 天门市| 沽源县| 重庆市| 长兴县| 台北县| 淅川县| 长阳| 独山县| 门源| 武乡县| 仁怀市| 盱眙县| 新田县| 东山县| 澄城县| 淮北市| 崇左市| 尉犁县| 龙井市| 永定县| 仁怀市| 蒙山县| 昭平县| 开平市| 泌阳县| 浦城县| 怀柔区| 雅安市| 祁阳县| 宜兴市| 保靖县| 安平县| 宽甸| 德格县| 兰考县| 金平|