- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 112字
- 2021-06-24 15:17:21
RL algorithm
The steps involved in typical RL algorithm are as follows:
- First, the agent interacts with the environment by performing an action
- The agent performs an action and moves from one state to another
- And then the agent will receive a reward based on the action it performed
- Based on the reward, the agent will understand whether the action was good or bad
- If the action was good, that is, if the agent received a positive reward, then the agent will prefer performing that action or else the agent will try performing an other action which results in a positive reward. So it is basically a trial and error learning process
推薦閱讀
- 數(shù)據(jù)浪潮
- 在你身邊為你設(shè)計(jì)Ⅲ:騰訊服務(wù)設(shè)計(jì)思維與實(shí)戰(zhàn)
- ETL數(shù)據(jù)整合與處理(Kettle)
- Libgdx Cross/platform Game Development Cookbook
- MySQL從入門到精通(第3版)
- 高維數(shù)據(jù)分析預(yù)處理技術(shù)
- 智慧的云計(jì)算
- 數(shù)據(jù)庫應(yīng)用系統(tǒng)開發(fā)實(shí)例
- Mastering LOB Development for Silverlight 5:A Case Study in Action
- Visual Studio 2013 and .NET 4.5 Expert Cookbook
- Mastering ROS for Robotics Programming(Second Edition)
- Hands-On System Programming with C++
- 大數(shù)據(jù)時(shí)代系列(套裝9冊(cè))
- PostgreSQL高可用實(shí)戰(zhàn)
- NoSQL數(shù)據(jù)庫原理(第2版·微課版)