官术网_书友最值得收藏!

How it works...

The random search algorithm works so well mainly because of the simplicity of our CartPole environment. Its observation state is composed of only four variables. You will recall that the observation in the Atari Space Invaders game is more than 100,000 (which is 210 * 160 * 3)  . The number of dimensions of the action state in CartPole is a third of that in Space Invaders. In general, simple algorithms work well for simple problems. In our case, we simply search for the best linear mapping from the observation to the action from a random pool.

Another interesting thing we've noticed is that before we select and deploy the best policy (the best linear mapping), random search also outperforms random action. This is because random linear mapping does take the observations into consideration. With more information from the environment, the decisions made in the random search policy are more intelligent than completely random ones.

主站蜘蛛池模板: 甘肃省| 涪陵区| 海林市| 观塘区| 南京市| 潜江市| 昌都县| 远安县| 津南区| 郧西县| 昌邑市| 略阳县| 资讯 | 呼图壁县| 丰顺县| 涿鹿县| 汤原县| 张家港市| 罗田县| 荥经县| 台东市| 城步| 交口县| 吉安市| 宜宾市| 玉田县| 上高县| 崇礼县| 阿图什市| 靖州| 灵武市| 荔浦县| 闽清县| 镇赉县| 永宁县| 南岸区| 长兴县| 西宁市| 桐柏县| 盘山县| 普安县|