官术网_书友最值得收藏!

Summary

In this chapter, we covered the building blocks, such as shallow and deep neural networks that included logistic regression, single hidden layer neural network, RNNs, LSTMs, CNNs, and their other variations. Catering to the these topics, we also covered multiple activation functions, how forward and backward propagation works, and the problems associated with the training of deep neural networks, such as vanishing and exploding gradients.

Then, we covered the very basic terminologies in reinforcement learning that we will explore in detail in the coming chapters. These were the optimality criteria, which are value function and policy. We also gained an understanding of some reinforcement learning algorithms, such as Q-learning and A3C algorithms. Then, we covered some basic computations in the TensorFlow framework, an introduction to OpenAI Gym, and also discussed some of the influential pioneers and research breakthroughs in the field of reinforcement learning.

In the following chapter, we will implement a basic reinforcement learning algorithm to a couple of OpenAI Gym framework environments and get a better understanding of OpenAI Gym.

主站蜘蛛池模板: 西贡区| 健康| 彰化市| 潼南县| 马龙县| 香港| 宁化县| 潢川县| 遂宁市| 辉县市| 永兴县| 新昌县| 恩平市| 诏安县| 英超| 远安县| 肇源县| 临泽县| 江门市| 莫力| 五指山市| 徐汇区| 洱源县| 桐城市| 抚宁县| 临桂县| 旅游| 毕节市| 兴安盟| 淅川县| 伊宁市| 长岛县| 元谋县| 德令哈市| 民权县| 五华县| 延长县| 建德市| 古丈县| 武山县| 临城县|