官术网_书友最值得收藏!

  • The Reinforcement Learning Workshop
  • Alessandro Palmas Emanuele Ghelfi Dr. Alexandra Galina Petre Mayur Kulkarni Anand N.S. Quan Nguyen Aritra Sen Anthony So Saikat Basak
  • 186字
  • 2021-06-11 18:37:49

Summary

This chapter introduced us to the key technologies and concepts we can use to get started with reinforcement learning. The first two sections described two OpenAI Tools, OpenAI Gym and OpenAI Universe. These are collections that contain a large number of control problems that cover a broad spectrum of contexts, from classic tasks to video games, from browser usage to algorithm deduction. We learned how the interfaces of these environments are formalized, how to interact with them, and how to create a custom environment for a specific problem. Then, we learned how to build a policy network with TensorFlow, how to feed it with environment states to retrieve corresponding actions, and how to save the policy network weights. We also studied another OpenAI resource, Baselines. We solved problems that demonstrated how to train a reinforcement learning agent to solve a classic control task. Finally, using all the elements introduced in this chapter, we built an agent and trained it to play a classic Atari video game, thus achieving better-than-human performance.

In the next chapter, we will be delving deep into dynamic programming for reinforcement learning.

主站蜘蛛池模板: 兴安县| 罗源县| 个旧市| 元江| 汉源县| 南陵县| 酒泉市| 大冶市| 安泽县| 永城市| 黔西| 崇义县| 酒泉市| 新乡市| 皮山县| 黄冈市| 宜川县| 郓城县| 维西| 建瓯市| 郧西县| 贵定县| 泾川县| 乐亭县| 松原市| 武夷山市| 扎囊县| 正宁县| 三江| 化德县| 白城市| 白城市| 林西县| 白城市| 饶阳县| 清远市| 纳雍县| 紫阳县| 纳雍县| 西平县| 都江堰市|