官术网_书友最值得收藏!

Cliff walking and grid world problems

Let's consider cliff walking and grid world problems. First, we will introduce these problems to you, then we will proceed on to the coding part. For both problems, we consider a rectangular grid with nrows (number of rows) and ncols (number of columns). We start from one cell to the south of the bottom left cell, and the goal is to reach the destination, which is one cell to the south of the bottom right cell.

Note that the start and destination cells are not part of the nrows x ncols grid of cells. For the cliff walking problem, the cells to the south of the bottom row of cells, except for the start and destination cells, form a cliff where, if the agent enters, the episode ends with catastrophic fall into the cliff. Likewise, if the agent tries to leave the left, top, or right boundaries of the grid of cells, it is placed back in the same cell, that is, it is equivalent to taking no action.

For the grid world problem, we do not have a cliff, but we have obstacles inside the grid world. If the agent tries to enter any of these obstacle cells, it is bounced back to the same cell from which it came. In both these problems, the goal is to find the optimum path from the start to the destination.

So, let's dive on in!

主站蜘蛛池模板: 岑巩县| 冀州市| 乌鲁木齐市| 鄂托克前旗| 元江| 韩城市| 甘肃省| 新蔡县| 武宣县| 尉氏县| 五河县| 方山县| 胶州市| 阿拉尔市| 锦屏县| 广平县| 璧山县| 嘉定区| 台湾省| 洮南市| 东平县| 墨竹工卡县| 乡宁县| 祁门县| 会同县| 通城县| 资兴市| 西乌珠穆沁旗| 竹溪县| 丽水市| 高唐县| 扎赉特旗| 长阳| 渭源县| 北碚区| 登封市| 彩票| 隆德县| 宽城| 遵义县| 云安县|