官术网_书友最值得收藏!

Reinforcement learning

The last type of learning in machine learning is reinforcement learning, in which there's no supervisor but only a reward signal.

So the reinforcement learning algorithm will try to make a decision and then a reward signal will be there to tell whether this decision is right or wrong. Also, this supervision feedback or reward signal may not come instantaneously but get delayed for a few steps. For example, the algorithm will take a decision now, but only after many steps will the reward signal tell whether decision was good or bad.

主站蜘蛛池模板: 宣武区| 灵宝市| 沅陵县| 潍坊市| 三门峡市| 绥阳县| 安宁市| 邵武市| 河东区| 河津市| 会宁县| 华安县| 潞城市| 黔东| 富源县| 南岸区| 界首市| 芷江| 海南省| 阿拉善左旗| 辽中县| 屯门区| 光山县| 托里县| 胶南市| 兰溪市| 客服| 缙云县| 偃师市| 毕节市| 双柏县| 新昌县| 绥棱县| 阜南县| 镇平县| 乳山市| 长泰县| 仙桃市| 淮阳县| 隆回县| 青龙|