官术网_书友最值得收藏!

Reinforcement learning algorithms

As we have seen in the previous sections, reinforcement learning is a programming technique that aims to develop algorithms that can learn and adapt to changes in the environment. This programming technique is based on the assumption of the agent being able to receive stimuli from the outside and to change its actions according to these stimuli. So, a correct choice will result in a reward while an incorrect choice will lead to a penalization of the system.

The goal of the system is to achieve the highest possible reward and consequently the best possible result. This result can be obtained through two approaches:

  • The first approach involves evaluating the choices of the algorithm and then rewarding or punishing the algorithm based on the result. These techniques can also adapt to substantial changes in the environment. An example is the image recognition programs that improve their performance with use. In this case we can say that learning takes place continuously.
  • In the second approach, a first phase is applied in which the algorithm is previously trained, and when the system is considered reliable, it is crystallized and no longer modifiable. This derives from the observation that constantly evaluating the actions of the algorithm can be a process that cannot be automated or that is very expensive.

These are only implementation choices, so it may happen that an algorithm includes the newly analyzed approaches.

So far, we have introduced the basic concepts of reinforcement learning. Now, we can analyze the various ways in which these concepts have been transformed into algorithms. In this section, we will list them, providing an overview, and we will deepen them in the practical cases that we will address in the following chapters.

主站蜘蛛池模板: 莱西市| 施秉县| 同心县| 抚宁县| 石景山区| 屏南县| 化州市| 鹰潭市| 新疆| 慈利县| 竹山县| 崇文区| 无棣县| 静乐县| 邻水| 平凉市| 沁源县| 水城县| 苍梧县| 杭锦旗| 桂阳县| 天峨县| 宜川县| 浦江县| 太原市| 兴隆县| 宜春市| 饶阳县| 平度市| 宁安市| 晋江市| 文山县| 那曲县| 高要市| 前郭尔| 安福县| 古浪县| 安顺市| 康乐县| 盘锦市| 靖远县|