官术网_书友最值得收藏!

Bagging

Bootstrap aggregating or bagging is an algorithm introduced by Leo Breiman in 1994, which applies bootstrapping to machine learning problems. Bootstrapping is a statistical procedure that creates datasets from existing data by sampling with replacement. Bootstrapping can be used to analyze the possible values that arithmetic mean, variance, or other quantity can assume.

The algorithm aims to reduce the chance of overfitting with the following steps:

  1. We generate new training sets from input train data by sampling with replacement
  2. For each generated training set, we fit a new model
  3. We combine the results of the models by averaging or majority voting

The following diagram illustrates the steps for bagging, using classification as an example:

We'll explore how to employ bagging mainly in Chapter 6, Predicting Online Ads Click-Through with Tree-Based Algorithms.

主站蜘蛛池模板: 潜山县| 柞水县| 琼中| 嘉善县| 固原市| 芦溪县| 积石山| 大同市| 枣阳市| 锦州市| 民权县| 遵义市| 宽城| 福建省| 临武县| 徐汇区| 乐清市| 永泰县| 新密市| 谢通门县| 腾冲县| 措美县| 甘肃省| 田东县| 广德县| 宜春市| 故城县| 武清区| 西华县| 铜川市| 五河县| 饶阳县| 南乐县| 淄博市| 黑龙江省| 富蕴县| 南召县| 三都| 枣阳市| 中山市| 搜索|