官术网_书友最值得收藏!

Value function

A value function denotes how good it is for an agent to be in a particular state. It is dependent on the policy and is often denoted by v(s). It is equal to the total expected reward received by the agent starting from the initial state. There can be several value functions; the optimal value function is the one that has the highest value for all the states compared to other value functions. Similarly, an optimal policy is the one that has the optimal value function.

主站蜘蛛池模板: 台安县| 孝感市| 宽甸| 长宁区| 九台市| 冕宁县| 伊金霍洛旗| 长海县| 甘孜县| 鄂伦春自治旗| 博湖县| 南川市| 甘德县| 盱眙县| 陇南市| 斗六市| 福州市| 舞阳县| 那坡县| 油尖旺区| 定襄县| 方正县| 南漳县| 抚州市| 泰宁县| 连云港市| 阜南县| 翼城县| 华亭县| 民乐县| 东阳市| 焉耆| 和林格尔县| 红原县| 福建省| 临颍县| 同江市| 平凉市| 乐安县| 任丘市| 内江市|