官术网_书友最值得收藏!

Programming an agent using an OpenAI Gym environment

The environment considered for this section is the Frozen Lake v0. The actual documentation of the concerned environment can be found at https://gym.openai.com/envs/FrozenLake-v0/.

This environment consists of 4 x 4 grids representing a lake. Thus, we have 16 grid blocks, where each block can be a start block(S), frozen block(F), goal block(G), or a hole block(H). Thus, the objective of the agent is to learn to navigate from start to goal without falling in the hole:

import Gym
env = Gym.make('FrozenLake-v0') #loads the environment FrozenLake-v0
env.render() # will output the environment and position of the agent

-------------------
S
FFF FHFH FFFH HFFG

At any given state, an agent has four actions to perform, which are up, down, left, and right. The reward at each step is 0 except the one leading to the goal state, then the reward would be 1. We start from the S state and our goal is to reach the G state without landing up in the H state in the most optimized path through the F states.

主站蜘蛛池模板: 南昌县| 开化县| 锡林浩特市| 安丘市| 麟游县| 屯留县| 东乡县| 桐梓县| 江口县| 南江县| 社会| 酒泉市| 南平市| 莲花县| 盘锦市| 呼和浩特市| 遂川县| 九寨沟县| 云龙县| 中山市| 莲花县| 吉木萨尔县| 临潭县| 分宜县| 阿克苏市| 屏东县| 海阳市| 金平| 钟山县| 当涂县| 包头市| 尼勒克县| 景洪市| 临江市| 焦作市| 手机| 哈尔滨市| 旬阳县| 松桃| 丰县| 通江县|