官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning is special in the sense that it doesn't require a dataset (see the following diagram). Instead, it involves an agent who takes actions, changing the state of the environment. After each step, it gets a reward or punishment, depending on the state and previous actions. The goal is to obtain a maximum cumulative reward. It can be used to teach the computer to play video games or drive a car. If you think about it, reinforcement learning is the way our pets train us humans: by rewarding our actions with tail-wagging, or punishing with scratched furniture.

One of the central topics in reinforcement learning is the exploration-exploitation dilemma—how to find a good balance between exploring new options and using what is already known:

Figure 1.3: Reinforcement learning process

Table 1.3: ML tasks:

主站蜘蛛池模板: 新安县| 亚东县| 改则县| 鱼台县| 井研县| 忻城县| 佛山市| 莱州市| 手机| 祥云县| 新蔡县| 广河县| 上栗县| 正安县| 大化| 吉首市| 综艺| 略阳县| 安溪县| 香格里拉县| 潼南县| 疏勒县| 扎赉特旗| 津市市| 庆阳市| 正定县| 寿光市| 樟树市| 南昌市| 灵宝市| 灌南县| 建宁县| 岢岚县| 杭州市| 南和县| 翼城县| 恩平市| 攀枝花市| 阿拉尔市| 社会| 富蕴县|