官术网_书友最值得收藏!

2. Markov Decision Processes and Bellman Equations

Overview

This chapter will cover more of the theory behind reinforcement learning. We will cover Markov chains, Markov reward processes, and Markov decision processes. We will learn about the concepts of state values and action values along with Bellman equations to calculate previous quantities. By the end of this chapter, you will be able to solve Markov decision processes using linear programming methods.

主站蜘蛛池模板: 当阳市| 新源县| 北海市| 左云县| 彭山县| 山阳县| 深泽县| 昂仁县| 鸡西市| 龙泉市| 德州市| 辽宁省| 长乐市| 沂水县| 宿州市| 大厂| 潮州市| 专栏| 古交市| 祁门县| 万源市| 三明市| 蒲江县| 武义县| 京山县| 黄浦区| 仁寿县| 滨海县| 霍山县| 万载县| 乐安县| 舒兰市| 濮阳县| 汉阴县| 姚安县| 监利县| 义乌市| 青冈县| 乌海市| 武安市| 旅游|