官术网_书友最值得收藏!

  • The Reinforcement Learning Workshop
  • Alessandro Palmas Emanuele Ghelfi Dr. Alexandra Galina Petre Mayur Kulkarni Anand N.S. Quan Nguyen Aritra Sen Anthony So Saikat Basak
  • 189字
  • 2021-06-11 18:37:43

Summary

RL is one of the fundamental paradigms under the umbrella of machine learning. The principles of RL are very general and interdisciplinary, and they are not bound to a specific application.

RL considers the interaction of an agent with an external environment, taking inspiration from the human learning process. RL explicitly targets the need to explore efficiently and the exploration-exploitation trade-off appearing in almost all human problems; this is a peculiarity that distinguishes this discipline from others.

We started this chapter with a high-level description of RL, showing some interesting applications. We then introduced the main concepts of RL, describing what an agent is, what an environment is, and how an agent interacts with its environment. Finally, we implemented Gym and Baselines by showing how these libraries make RL extremely simple.

In the next chapter, we will learn more about the theory behind RL, starting with Markov chains and arriving at MDPs. We will present the two functions at the core of almost all RL algorithms, namely the state-value function, which evaluates the goodness of states, and the action-value function, which evaluates the quality of the state-action pair.

主站蜘蛛池模板: 玛沁县| 汉中市| 花莲市| 商水县| 南开区| 都江堰市| 江城| 昌宁县| 衡东县| 娱乐| 白水县| 祁阳县| 朝阳区| 西乡县| 临海市| 广汉市| 武定县| 遵义市| 中江县| 深泽县| 邛崃市| 米林县| 武邑县| 新绛县| 黄梅县| 濮阳县| 修水县| 衡水市| 广水市| 清水河县| 北京市| 九寨沟县| 沈丘县| 嘉定区| 漳州市| 邵东县| 香港| 宣汉县| 法库县| 泸水县| 霍邱县|