- Hands-On Deep Learning Architectures with Python
- Yuxi (Hayden) Liu Saransh Mehta
- 209字
- 2021-06-24 14:48:09
Reinforcement learning
Remember how you learned to ride a bicycle in your childhood? It was a trial and error process, right? You tried to balance yourself, and each time you did something wrong, you tipped off the bicycle. But, you learned from your mistakes, and eventually, you were able to ride without falling. In the same way, Reinforcement learning does the same! An agent is exposed to an environment where it takes action from a list of possible actions, which leads to a change in the state of the agent. A state is the current situation of the environment the agent is in. For every action, the agent receives an award. Whenever the received reward is positive, it signifies the agent has taken the correct step, and when the reward is negative, it signifies a mistake. The agent follows a policy, a reinforcement learning algorithm through which the agent determines next actions considering the current state. Reinforcement learning is the true form of artificial intelligence, inspired by a human's way of learning through trial and error. Think of yourself as the agent and the bicycle the environment! Discussing reinforcement learning algorithms here is beyond the scope of this book, so let's shift focus back to deep learning!
- 大數(shù)據(jù)戰(zhàn)爭:人工智能時代不能不說的事
- 21天學通JavaScript
- 大數(shù)據(jù)專業(yè)英語
- 機器學習與大數(shù)據(jù)技術(shù)
- Apache Superset Quick Start Guide
- 新編計算機組裝與維修
- 聊天機器人:入門、進階與實戰(zhàn)
- 格蠹匯編
- 基于敏捷開發(fā)的數(shù)據(jù)結(jié)構(gòu)研究
- 手機游戲策劃設(shè)計
- 工業(yè)機器人集成應(yīng)用
- 無人駕駛感知智能
- Hands-On Deep Learning with Go
- 樂高創(chuàng)意機器人教程(中級 上冊 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實踐指導叢書)
- QTP自動化測試實踐