官术网_书友最值得收藏!

  • The Reinforcement Learning Workshop
  • Alessandro Palmas Emanuele Ghelfi Dr. Alexandra Galina Petre Mayur Kulkarni Anand N.S. Quan Nguyen Aritra Sen Anthony So Saikat Basak
  • 186字
  • 2021-06-11 18:37:49

Summary

This chapter introduced us to the key technologies and concepts we can use to get started with reinforcement learning. The first two sections described two OpenAI Tools, OpenAI Gym and OpenAI Universe. These are collections that contain a large number of control problems that cover a broad spectrum of contexts, from classic tasks to video games, from browser usage to algorithm deduction. We learned how the interfaces of these environments are formalized, how to interact with them, and how to create a custom environment for a specific problem. Then, we learned how to build a policy network with TensorFlow, how to feed it with environment states to retrieve corresponding actions, and how to save the policy network weights. We also studied another OpenAI resource, Baselines. We solved problems that demonstrated how to train a reinforcement learning agent to solve a classic control task. Finally, using all the elements introduced in this chapter, we built an agent and trained it to play a classic Atari video game, thus achieving better-than-human performance.

In the next chapter, we will be delving deep into dynamic programming for reinforcement learning.

主站蜘蛛池模板: 广西| 崇左市| 和静县| 泰顺县| 广西| 太仓市| 大悟县| 安塞县| 厦门市| 榆社县| 永兴县| 积石山| 怀化市| 商南县| 武川县| 洛阳市| 苍梧县| 皋兰县| 弥渡县| 东海县| 长岛县| 奈曼旗| 和林格尔县| 驻马店市| 临沧市| 成武县| 兰考县| 弋阳县| 广水市| 旌德县| 昔阳县| 昌吉市| 长葛市| 清新县| 亳州市| 扎鲁特旗| 广灵县| 古蔺县| 胶州市| 辽阳县| 桓仁|