官术网_书友最值得收藏!

  • Deep Learning By Example
  • Ahmed Menshawy
  • 216字
  • 2021-06-24 18:52:43

Generalization/true error

This is the second and more important type of error in data science. The whole purpose of building learning systems is the ability to get a smaller generalization error on the test set; in other words, to get the model to work well on a set of observation/samples that haven't been used in the training phase. If you still consider the class scenario from the previous section, you can think of generalization error as the ability to solve exam problems that weren’t necessarily similar to the problems you solved in the classroom to learn and get familiar with the subject. So, generalization performance is the model's ability to use the skills (parameters) that it learned in the training phase in order to correctly predict the outcome/output of unseen data.

In Figure 13, the light blue line represents the generalization error. You can see that as you increase the model complexity, the generalization error will be reduced, until some point when the model will start to lose its increasing power and the generalization error will decrease. This part of the curve where you get the generalization error to lose its increasing generalization power, is called overfitting.

The takeaway message from this section is to minimize the generalization error as much as you can.

主站蜘蛛池模板: 龙州县| 兰州市| 馆陶县| 娱乐| 扎赉特旗| 武陟县| 万宁市| 连山| 连山| 苗栗县| 抚顺市| 定远县| 微山县| 弋阳县| 肇源县| 泰来县| 宁武县| 元谋县| 华容县| 雷波县| 陆川县| 临沭县| 景宁| 姜堰市| 阿合奇县| 眉山市| 菏泽市| 渭南市| 阜宁县| 株洲市| 泽州县| 揭阳市| 门头沟区| 清远市| 阳谷县| 滦南县| 济宁市| 陵川县| 开化县| 如东县| 城口县|