官术网_书友最值得收藏!

  • Python Reinforcement Learning
  • Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
  • 100字
  • 2021-06-24 15:17:22

Model

Model is the agent's representation of an environment. The learning can be of two types—model-based learning and model-free learning. In model-based learning, the agent exploits previously learned information to accomplish a task, whereas in model-free learning, the agent simply relies on a trial-and-error experience for performing the right action. Say you want to reach your office from home faster. In model-based learning, you simply use a previously learned experience (map) to reach the office faster, whereas in model-free learning you will not use a previous experience and will try all different routes and choose the faster one.

主站蜘蛛池模板: 根河市| 远安县| 长顺县| 卢湾区| 巴彦县| 巴南区| 娄烦县| 曲阜市| 墨竹工卡县| 晋宁县| 芮城县| 东至县| 清远市| 芦山县| 涟源市| 阳谷县| 鹤岗市| 正镶白旗| 宾阳县| 义马市| 黄冈市| 含山县| 商城县| 孝义市| 乌鲁木齐县| 万荣县| 潍坊市| 磴口县| 金秀| 普宁市| 长泰县| 庆城县| 茶陵县| 东辽县| 民勤县| 凤庆县| 乳山市| 同心县| 隆回县| 安国市| 绵竹市|