官术网_书友最值得收藏!

How it works...

The random search algorithm works so well mainly because of the simplicity of our CartPole environment. Its observation state is composed of only four variables. You will recall that the observation in the Atari Space Invaders game is more than 100,000 (which is 210 * 160 * 3)  . The number of dimensions of the action state in CartPole is a third of that in Space Invaders. In general, simple algorithms work well for simple problems. In our case, we simply search for the best linear mapping from the observation to the action from a random pool.

Another interesting thing we've noticed is that before we select and deploy the best policy (the best linear mapping), random search also outperforms random action. This is because random linear mapping does take the observations into consideration. With more information from the environment, the decisions made in the random search policy are more intelligent than completely random ones.

主站蜘蛛池模板: 内黄县| 仲巴县| 乡宁县| 沙雅县| 读书| 青冈县| 余江县| 桑植县| 菏泽市| 逊克县| 准格尔旗| 安吉县| 大同市| 图片| 象州县| 柘荣县| 浦江县| 突泉县| 澎湖县| 冕宁县| 宜春市| 敦煌市| 天门市| 隆化县| 聂拉木县| 奉新县| 环江| 九江市| 韶山市| 西华县| 天气| 平舆县| 万州区| 延边| 运城市| 怀化市| 永昌县| 河北区| 天津市| 宣城市| 荆州市|