官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli, the nature of which depends on the algorithm choices. A correct choice will involve a reward, while an incorrect choice will lead to a penalty. The goal of the system is to achieve the best possible result, of course.

In supervised learning, there is a teacher that tells the system the correct output (learning with a teacher). This is not always possible. Often, we have only qualitative information (sometimes binary, right/wrong, or success/failure).

The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create smart agents that have machinery able to learn from their experience.

主站蜘蛛池模板: 青冈县| 镇原县| 工布江达县| 台江县| 兰溪市| 南康市| 沙洋县| 南安市| 梁平县| 大姚县| 翁牛特旗| 噶尔县| 怀柔区| 根河市| 根河市| 田东县| 公安县| 阳信县| 大埔县| 平乐县| 绥德县| 新化县| 宜黄县| 嘉定区| 都匀市| 同江市| 富宁县| 合肥市| 合水县| 东山县| 新余市| 肃宁县| 岳阳县| 东辽县| 阳西县| 手机| 夏津县| 榕江县| 奎屯市| 寿宁县| 古丈县|