- MATLAB for Machine Learning
- Giuseppe Ciaburro
- 170字
- 2021-07-02 19:37:33
Reinforcement learning
Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli depending on the algorithm choices. A correct choice will involve a premium while an incorrect choice will lead to a penalty. The goal of system is to achieve the best possible result, of course.
In supervised learning, there is a teacher that tells the system which is the correct output (learning with a teacher). This is not always possible. Often we have only qualitative information (sometimes binary, right/wrong, or success/failure).
The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create the smart agents that have a machinery able to learn from their experience.
This flowchart shows reinforcement learning:

- SPSS數(shù)據(jù)挖掘與案例分析應(yīng)用實(shí)踐
- ClickHouse性能之巔:從架構(gòu)設(shè)計(jì)解讀性能之謎
- Spring 5企業(yè)級(jí)開(kāi)發(fā)實(shí)戰(zhàn)
- 華為HMS生態(tài)與應(yīng)用開(kāi)發(fā)實(shí)戰(zhàn)
- Building Mobile Applications Using Kendo UI Mobile and ASP.NET Web API
- concrete5 Cookbook
- Hands-On Natural Language Processing with Python
- HTML5從入門到精通 (第2版)
- C程序設(shè)計(jì)實(shí)踐教程
- Building Wireless Sensor Networks Using Arduino
- 小程序從0到1:微信全棧工程師一本通
- Java Web應(yīng)用開(kāi)發(fā)給力起飛
- SwiftUI極簡(jiǎn)開(kāi)發(fā)
- Arduino機(jī)器人系統(tǒng)設(shè)計(jì)及開(kāi)發(fā)
- 從零開(kāi)始學(xué)UI:概念解析、實(shí)戰(zhàn)提高、突破規(guī)則