官术网_书友最值得收藏!

Reinforcement learning

Remember how you learned to ride a bicycle in your childhood? It was a trial and error process, right? You tried to balance yourself, and each time you did something wrong, you tipped off the bicycle. But, you learned from your mistakes, and eventually, you were able to ride without falling. In the same way, Reinforcement learning does the same! An agent is exposed to an environment where it takes action from a list of possible actions, which leads to a change in the state of the agent. A state is the current situation of the environment the agent is in. For every action, the agent receives an award. Whenever the received reward is positive, it signifies the agent has taken the correct step, and when the reward is negative, it signifies a mistake. The agent follows a policy, a reinforcement learning algorithm through which the agent determines next actions considering the current state. Reinforcement learning is the true form of artificial intelligence, inspired by a human's way of learning through trial and error. Think of yourself as the agent and the bicycle the environment! Discussing reinforcement learning algorithms here is beyond the scope of this book, so let's shift focus back to deep learning!

主站蜘蛛池模板: 土默特右旗| 天长市| 仁化县| 通江县| 宁德市| 河东区| 定南县| 三台县| 红桥区| 涡阳县| 双峰县| 满城县| 虹口区| 鄂州市| 沅江市| 浠水县| 珲春市| 信宜市| 罗定市| 许昌市| 涟源市| 永城市| 沂源县| 南京市| 温州市| 东港市| 乌鲁木齐县| 驻马店市| 招远市| 宣恩县| 洪雅县| 岳普湖县| 桐城市| 双江| 双流县| 浙江省| 屏东市| 合山市| 东山县| 葵青区| 诸城市|