官术网_书友最值得收藏!

Understanding SARSA and Q-Learning 

In this section, we will learn about SARSA and Q-Learning and how can they are coded with Python. Before we go further, let's find out what SARSA and Q-Learning are. SARSA is an algorithm that uses the state-action Q values to update. These concepts are derived from the computer science field of dynamic programming, while Q-learning is an off-policy algorithm that was first proposed by Christopher Watkins in 1989, and is a widely used RL algorithm. 

主站蜘蛛池模板: 莱州市| 柳州市| 敦化市| 铁力市| 合作市| 来宾市| 河间市| 潼南县| 博客| 乐业县| 密山市| 衡山县| 扶绥县| 龙门县| 长沙市| 东乡族自治县| 胶州市| 巴南区| 婺源县| 壤塘县| 渭南市| 马关县| 澄江县| 彩票| 宕昌县| 乌兰察布市| 尚志市| 曲松县| 东明县| 平南县| 县级市| 宁津县| 鱼台县| 朝阳县| 鄯善县| 潜江市| 根河市| 工布江达县| 长乐市| 山丹县| 霍邱县|