- Intelligent Projects Using Python
- Santanu Pattanayak
- 169字
- 2021-07-02 14:10:45
Deep Q-learning
In Q-learning, we generally work with a finite set of states and actions; this means that, tables suffice to hold the Q-values and rewards. However, in practical applications, the number of states and applicable actions are mostly infinite, and better Q-function approximators are needed to represent and learn the Q-functions. This is where deep neural networks come to the rescue, since they are universal function approximators. We can represent the Q-function with a neural network that takes the states and actions as input and provides the corresponding Q-values as output. Alternatively, we can train a neural network using only the states, and have the output as Q-values corresponding to all of the actions. Both of these scenarios are illustrated in the following diagram. Since the Q-values are rewards, we are dealing with regression in these networks:

In this book, we will use reinforcement learning to train a race car to drive by itself through deep Q-learning.
- 零點起飛學Xilinx FPG
- 新型電腦主板關鍵電路維修圖冊
- Manage Partitions with GParted How-to
- 筆記本電腦使用、維護與故障排除從入門到精通(第5版)
- 面向對象分析與設計(第3版)(修訂版)
- 圖解計算機組裝與維護
- 單片微機原理及應用
- FreeSWITCH Cookbook
- 微控制器的應用
- Blender 3D By Example
- Building Machine Learning Systems with Python
- USB應用開發寶典
- FPGA實戰訓練精粹
- 微服務架構基礎(Spring Boot+Spring Cloud+Docker)
- 基于S5PV210處理器的嵌入式開發完全攻略