官术网_书友最值得收藏!

Value function

A value function denotes how good it is for an agent to be in a particular state. It is dependent on the policy and is often denoted by v(s). It is equal to the total expected reward received by the agent starting from the initial state. There can be several value functions; the optimal value function is the one that has the highest value for all the states compared to other value functions. Similarly, an optimal policy is the one that has the optimal value function.

主站蜘蛛池模板: 绥芬河市| 富平县| 芒康县| 南华县| 名山县| 牡丹江市| 图片| 广水市| 澳门| 康马县| 靖边县| 尖扎县| 微山县| 乡城县| 宁武县| 瑞金市| 南澳县| 城固县| 汤原县| 清丰县| 重庆市| 视频| 永定县| 石台县| 吉安县| 陆丰市| 德安县| 临沂市| 曲阜市| 古田县| 冀州市| 本溪| 张家川| 屯门区| 江永县| 绥化市| 图木舒克市| 珲春市| 科技| 鄂托克旗| 新和县|