官术网_书友最值得收藏!

Learning curves

Another method of assessing a model's performance is by evaluating the model's growth of learning or the model's ability to improve learning (obtain a better score) with additional experience (for example, more rounds of cross-validation).

Note

Learning is the act of acquiring new, or modifying and reinforcing existing, knowledge.

The information indicating a model's result or score with a data file population can be combined with other scores to show a line or curve, which is known as a model's learning curve.

A learning curve is a graphical representation of the growth of learning (the scores shown in a vertical axis) with practice (the individual data files or rounds shown in the horizontal axis).

This can also be conceptualized as:

  • The same task repeated in a series
  • A body of knowledge learned over time

The following figure illustrates a hypothetical learning curve, showing the improved learning of a predictive model using resultant scores by cross-validation round:

Learning curves

Source link: https://en.wikipedia.org/wiki/File:Alanf777_Lcd_fig01.png

Tip

It's funny; one might know that the familiar expression it's a steep learning curve is intended to describe an activity that is tough to learn, but in statistics, a learning curve with a steep start would actually represent a rapidly improving progress.

Learning curves relating model performance to experience are commonly found to be used when performing model assessments.

As we have mentioned earlier in this section, performance (or the scores) is meant to be the accuracy of a model while experience (or round) may be the number of training examples, datasets, or iterations used in optimizing the model parameters.

Plot and ping

Using two generic R functions, we can demonstrate a simple learning curve visualization. Ping will open an image file which will hold our learning curve visualization so we can easily include it in a document later, and plot will draw our graphic.

The following are our example R code statements:

# -- 5 rounds of numeric test scores saved in a vector named "v"
v <-c(74,79, 88, 90, 99)

# -- create an image file for the visualization for later use
png(file = "c:/simple example/learning curve.png", type = c("windows", "cairo", "cairo-png"))

# -- plot the model scores round by round
plot(v, type = "o", col = "red", xlab = "Round", ylab = "Score", main = "Learning Curve")

# -- close output 
dev.off()

The preceding statements create the following graphic as a file:

Plot and ping
主站蜘蛛池模板: 吉安县| 剑阁县| 屯留县| 许昌市| 陈巴尔虎旗| 常山县| 大同县| 巴塘县| 进贤县| 万山特区| 如东县| 筠连县| 铁岭县| 新田县| 平陆县| 浦北县| 曲周县| 漳平市| 公主岭市| 河源市| 阳原县| 肥西县| 资阳市| 玛沁县| 临洮县| 专栏| 大同县| 夏河县| 留坝县| 甘泉县| 青川县| 金坛市| 喀喇| 陆丰市| 北安市| 咸阳市| 安塞县| 忻城县| 大同市| 平南县| 开化县|