- Python Reinforcement Learning Projects
- Sean Saito Yang Wenzhuo Rajalingappaa Shanmugamani
- 308字
- 2021-07-23 19:05:00
Policy
A policy is an algorithm or a set of rules that describe how an agent makes its decisions. An example policy can be the strategy an investor uses to trade stocks, where the investor buys a stock when its price goes down and sells the stock when the price goes up.
More formally, a policy is a function, usually denoted as , that maps a state,
, to an action,
:
This means that an agent decides its action given its current state. This function can represent anything, as long as it can receive a state as input and output an action, be it a table, graph, or machine learning classifier.
For example, suppose we have an agent that is supposed to navigate a maze. We shall further assume that the agent knows what the maze looks like; the following is how the agent's policy can be represented:
Each white square in this maze represents a state the agent can be in. Each blue arrow refers to the action an agent would take in the corresponding square. This essentially represents the agent's policy for this maze. Moreover, this can also be regarded as a deterministic policy, for the mapping from the state to the action is deterministic. This is in contrast to a stochastic policy, where a policy would output a probability distribution over the possible actions given some state:
Here,is a normalized probability vector over all the possible actions, as shown in the following example:
The agent playing the game of Breakout has a policy that takes the screen of the game as input and returns a probability for each possible action.
- Big Data Analytics with Hadoop 3
- 現代測控電子技術
- Word 2000、Excel 2000、PowerPoint 2000上機指導與練習
- 輕輕松松自動化測試
- AutoCAD繪圖實用速查通典
- Hands-On Artificial Intelligence on Amazon Web Services
- JavaScript實例自學手冊
- 輕松學Java
- Learning Apache Cassandra(Second Edition)
- Visual C# 2008開發技術詳解
- 基于ARM 32位高速嵌入式微控制器
- Moodle Course Design Best Practices
- Implementing AWS:Design,Build,and Manage your Infrastructure
- Docker on Amazon Web Services
- LAMP網站開發黃金組合Linux+Apache+MySQL+PHP