官术网_书友最值得收藏!

Summary

In this chapter, we started our journey into the RL world by learning what makes RL special and how it relates to the supervised and unsupervised learning paradigm. We then learned about the basic RL formalisms and how they interact with each other, after which we defined Markov process, Markov reward process, and Markov decision process.

In the next chapter, we'll move away from the formal theory into the practice of RL. We'll cover the setup required, libraries, and write our first agent.

主站蜘蛛池模板: 滦南县| 浦县| 大方县| 阿拉善右旗| 永川市| 宜春市| 增城市| 石狮市| 太谷县| 墨江| 屯门区| 大城县| 肥乡县| 普安县| 新绛县| 宁远县| 神池县| 平阴县| 肥城市| 广平县| 永平县| 泰顺县| 兴海县| 临安市| 香格里拉县| 秦皇岛市| 政和县| 竹山县| 雅江县| 遵义县| 岢岚县| 辛集市| 句容市| 塔城市| 呼伦贝尔市| 龙陵县| 阜康市| 中宁县| 墨江| 廉江市| 老河口市|