官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli, the nature of which depends on the algorithm choices. A correct choice will involve a reward, while an incorrect choice will lead to a penalty. The goal of the system is to achieve the best possible result, of course.

In supervised learning, there is a teacher that tells the system the correct output (learning with a teacher). This is not always possible. Often, we have only qualitative information (sometimes binary, right/wrong, or success/failure).

The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create smart agents that have machinery able to learn from their experience.

主站蜘蛛池模板: 伊春市| 遵义县| 江北区| 荃湾区| 盐边县| 宁明县| 南雄市| 金门县| 磴口县| 略阳县| 九龙坡区| 万年县| 乌兰浩特市| 临城县| 灵山县| 资阳市| 莱阳市| 伽师县| 长沙市| 中宁县| 仙游县| 瑞丽市| 拜泉县| 徐闻县| 鹤庆县| 赤城县| 许昌市| 潞西市| 千阳县| 秀山| 恩平市| 三明市| 沅江市| 儋州市| 岳西县| 平潭县| 遂昌县| 临沂市| 桃源县| 吉林省| 西盟|