官术网_书友最值得收藏!

Reinforcement learning

The last type of learning in machine learning is reinforcement learning, in which there's no supervisor but only a reward signal.

So the reinforcement learning algorithm will try to make a decision and then a reward signal will be there to tell whether this decision is right or wrong. Also, this supervision feedback or reward signal may not come instantaneously but get delayed for a few steps. For example, the algorithm will take a decision now, but only after many steps will the reward signal tell whether decision was good or bad.

主站蜘蛛池模板: 长汀县| 和硕县| 饶平县| 寻甸| 延安市| 高陵县| 嘉祥县| 长宁县| 漳平市| 临漳县| 鹤庆县| 泰宁县| 通海县| 凉山| 龙江县| 太康县| 囊谦县| 镇宁| 乐至县| 泸水县| 长阳| 依安县| 高密市| 什邡市| 修文县| 禄丰县| 柘城县| 玉树县| 宽甸| 彭泽县| 昌乐县| 盐津县| 缙云县| 阿鲁科尔沁旗| 临湘市| 潮州市| 台湾省| 昭通市| 广丰县| 芜湖市| 吉林省|