官术网_书友最值得收藏!

Summary

In this chapter, we started our journey into the RL world by learning what makes RL special and how it relates to the supervised and unsupervised learning paradigm. We then learned about the basic RL formalisms and how they interact with each other, after which we defined Markov process, Markov reward process, and Markov decision process.

In the next chapter, we'll move away from the formal theory into the practice of RL. We'll cover the setup required, libraries, and write our first agent.

主站蜘蛛池模板: 丰宁| 偃师市| 自贡市| 荆州市| 承德市| 昌乐县| 峡江县| 金溪县| 项城市| 天峨县| 开江县| 民勤县| 琼中| 六安市| 栖霞市| 连城县| 诸暨市| 通榆县| 额尔古纳市| 邵阳县| 崇仁县| 南宫市| 夏河县| 浏阳市| 句容市| 甘泉县| 个旧市| 美姑县| 潮州市| 吉安县| 宁晋县| 错那县| 龙泉市| 酒泉市| 东阳市| 北海市| 波密县| 静宁县| 抚顺县| 通山县| 陕西省|