官术网_书友最值得收藏!

Reinforcement learning algorithms

As we have seen in the previous sections, reinforcement learning is a programming technique that aims to develop algorithms that can learn and adapt to changes in the environment. This programming technique is based on the assumption of the agent being able to receive stimuli from the outside and to change its actions according to these stimuli. So, a correct choice will result in a reward while an incorrect choice will lead to a penalization of the system.

The goal of the system is to achieve the highest possible reward and consequently the best possible result. This result can be obtained through two approaches:

  • The first approach involves evaluating the choices of the algorithm and then rewarding or punishing the algorithm based on the result. These techniques can also adapt to substantial changes in the environment. An example is the image recognition programs that improve their performance with use. In this case we can say that learning takes place continuously.
  • In the second approach, a first phase is applied in which the algorithm is previously trained, and when the system is considered reliable, it is crystallized and no longer modifiable. This derives from the observation that constantly evaluating the actions of the algorithm can be a process that cannot be automated or that is very expensive.

These are only implementation choices, so it may happen that an algorithm includes the newly analyzed approaches.

So far, we have introduced the basic concepts of reinforcement learning. Now, we can analyze the various ways in which these concepts have been transformed into algorithms. In this section, we will list them, providing an overview, and we will deepen them in the practical cases that we will address in the following chapters.

主站蜘蛛池模板: 巫溪县| 手游| 沈阳市| 大余县| 南京市| 礼泉县| 乐清市| 布拖县| 宁城县| 盐亭县| 滦南县| 枞阳县| 富顺县| 苗栗市| 八宿县| 衡阳县| 垣曲县| 韶山市| 墨脱县| 台中县| 遵化市| 同心县| 高陵县| 白水县| 邵武市| 汨罗市| 泸溪县| 赫章县| 乌恰县| 壶关县| 怀宁县| 宁安市| 仙居县| 怀宁县| 岢岚县| 深泽县| 礼泉县| 石渠县| 鹿邑县| 于都县| 阜新市|