官术网_书友最值得收藏!

Reinforcement learning

Remember how you learned to ride a bicycle in your childhood? It was a trial and error process, right? You tried to balance yourself, and each time you did something wrong, you tipped off the bicycle. But, you learned from your mistakes, and eventually, you were able to ride without falling. In the same way, Reinforcement learning does the same! An agent is exposed to an environment where it takes action from a list of possible actions, which leads to a change in the state of the agent. A state is the current situation of the environment the agent is in. For every action, the agent receives an award. Whenever the received reward is positive, it signifies the agent has taken the correct step, and when the reward is negative, it signifies a mistake. The agent follows a policy, a reinforcement learning algorithm through which the agent determines next actions considering the current state. Reinforcement learning is the true form of artificial intelligence, inspired by a human's way of learning through trial and error. Think of yourself as the agent and the bicycle the environment! Discussing reinforcement learning algorithms here is beyond the scope of this book, so let's shift focus back to deep learning!

主站蜘蛛池模板: 鸡泽县| 温泉县| 泌阳县| 通辽市| 西吉县| 泸西县| 水城县| 赣州市| 慈溪市| 垦利县| 越西县| 都江堰市| 镇康县| 教育| 永吉县| 甘孜| 叶城县| 剑河县| 东乌| 佛山市| 梅河口市| 六枝特区| 仲巴县| 新乡市| 白朗县| 驻马店市| 大邑县| 安岳县| 印江| 准格尔旗| 南江县| 和硕县| 正蓝旗| 沈阳市| 绥棱县| 仙游县| 宣汉县| 岗巴县| 毕节市| 西畴县| 蒙山县|