官术网_书友最值得收藏!

Reinforcement

In the reinforcement machine learning model, the agent is in interaction with its environment, so it learns from experience, by collecting data during the process; the goal is optimizing what we call a long term reward. You can view it as a game with a scoring system. The following graph illustrates a reinforcement model:

主站蜘蛛池模板: 偃师市| 资溪县| 遵义县| 广汉市| 都昌县| 安多县| 德惠市| 将乐县| 宝兴县| 彝良县| 西藏| 淅川县| 西充县| 天水市| 磐石市| 泰兴市| 通榆县| 江安县| 福海县| 张家口市| 乳山市| 监利县| 中牟县| 巴彦淖尔市| 云浮市| 建德市| 阿勒泰市| 临桂县| 巴彦县| 黄大仙区| 蒲城县| 新宁县| 布拖县| 仙居县| 科技| 无棣县| 奎屯市| 花莲市| 大田县| 常德市| 弥勒县|