官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning is special in the sense that it doesn't require a dataset (see the following diagram). Instead, it involves an agent who takes actions, changing the state of the environment. After each step, it gets a reward or punishment, depending on the state and previous actions. The goal is to obtain a maximum cumulative reward. It can be used to teach the computer to play video games or drive a car. If you think about it, reinforcement learning is the way our pets train us humans: by rewarding our actions with tail-wagging, or punishing with scratched furniture.

One of the central topics in reinforcement learning is the exploration-exploitation dilemma—how to find a good balance between exploring new options and using what is already known:

Figure 1.3: Reinforcement learning process

Table 1.3: ML tasks:

主站蜘蛛池模板: 花垣县| 会昌县| 都兰县| 浑源县| 都江堰市| 松原市| 濮阳市| 莱西市| 塔城市| 磐安县| 上饶县| 望谟县| 古丈县| 鹤壁市| 舟山市| 芒康县| 隆回县| 利津县| 余姚市| 航空| 黑河市| 侯马市| 龙泉市| 沙雅县| 维西| 色达县| 略阳县| 建瓯市| 襄垣县| 遂溪县| 通榆县| 高邮市| 华亭县| 安平县| 平武县| 朔州市| 德庆县| 霍邱县| 民乐县| 抚松县| 治县。|