官术网_书友最值得收藏!

Summary

In this chapter, we covered the building blocks, such as shallow and deep neural networks that included logistic regression, single hidden layer neural network, RNNs, LSTMs, CNNs, and their other variations. Catering to the these topics, we also covered multiple activation functions, how forward and backward propagation works, and the problems associated with the training of deep neural networks, such as vanishing and exploding gradients.

Then, we covered the very basic terminologies in reinforcement learning that we will explore in detail in the coming chapters. These were the optimality criteria, which are value function and policy. We also gained an understanding of some reinforcement learning algorithms, such as Q-learning and A3C algorithms. Then, we covered some basic computations in the TensorFlow framework, an introduction to OpenAI Gym, and also discussed some of the influential pioneers and research breakthroughs in the field of reinforcement learning.

In the following chapter, we will implement a basic reinforcement learning algorithm to a couple of OpenAI Gym framework environments and get a better understanding of OpenAI Gym.

主站蜘蛛池模板: 宜昌市| 天气| 榆中县| 乌兰浩特市| 南宫市| 萨嘎县| 尉氏县| 天长市| 汨罗市| 淮阳县| 鹰潭市| 芦溪县| 仪征市| 武威市| 上犹县| 宜君县| 广昌县| 额济纳旗| 麻江县| 五河县| 富源县| 定襄县| 潜江市| 蒲城县| 阳春市| 麦盖提县| 湖口县| 马公市| 科技| 顺昌县| 罗江县| 宜良县| 澳门| 枝江市| 陆川县| 拜城县| 沂水县| 凤山县| 稻城县| 洛扎县| 读书|