- PyTorch 1.x Reinforcement Learning Cookbook
- Yuxi (Hayden) Liu
- 160字
- 2021-06-24 12:34:41
How it works...
The random search algorithm works so well mainly because of the simplicity of our CartPole environment. Its observation state is composed of only four variables. You will recall that the observation in the Atari Space Invaders game is more than 100,000 (which is 210 * 160 * 3) . The number of dimensions of the action state in CartPole is a third of that in Space Invaders. In general, simple algorithms work well for simple problems. In our case, we simply search for the best linear mapping from the observation to the action from a random pool.
Another interesting thing we've noticed is that before we select and deploy the best policy (the best linear mapping), random search also outperforms random action. This is because random linear mapping does take the observations into consideration. With more information from the environment, the decisions made in the random search policy are more intelligent than completely random ones.
- 繪制進程圖:可視化D++語言(第1冊)
- Splunk 7 Essentials(Third Edition)
- PIC單片機C語言非常入門與視頻演練
- 群體智能與數據挖掘
- 空間傳感器網絡復雜區域智能監測技術
- Security Automation with Ansible 2
- Visual C# 2008開發技術詳解
- 工業機器人入門實用教程(KUKA機器人)
- Spark大數據技術與應用
- Apache Superset Quick Start Guide
- 完全掌握AutoCAD 2008中文版:機械篇
- 我也能做CTO之程序員職業規劃
- 大數據驅動的機械裝備智能運維理論及應用
- Linux Shell編程從初學到精通
- 青少年VEX IQ機器人實訓課程(初級)