官术网_书友最值得收藏!

Bagging

Bootstrap aggregating or bagging is an algorithm introduced by Leo Breiman in 1994, which applies bootstrapping to machine learning problems. Bootstrapping is a statistical procedure that creates datasets from existing data by sampling with replacement. Bootstrapping can be used to analyze the possible values that arithmetic mean, variance, or other quantity can assume.

The algorithm aims to reduce the chance of overfitting with the following steps:

  1. We generate new training sets from input train data by sampling with replacement
  2. For each generated training set, we fit a new model
  3. We combine the results of the models by averaging or majority voting

The following diagram illustrates the steps for bagging, using classification as an example:

We'll explore how to employ bagging mainly in Chapter 6, Predicting Online Ads Click-Through with Tree-Based Algorithms.

主站蜘蛛池模板: 互助| 通州市| 宜黄县| 剑阁县| 集安市| 洛扎县| 德保县| 辛集市| 广平县| 邵东县| 鄂托克前旗| 茶陵县| 淮北市| 曲靖市| 海丰县| 沧州市| 华坪县| 台中县| 霍邱县| 会泽县| 阿瓦提县| 四会市| 甘谷县| 宜阳县| 郸城县| 新营市| 长兴县| 板桥市| 香港 | 井研县| 云安县| 连山| 响水县| 石狮市| 公主岭市| 平远县| 莒南县| 拉萨市| 博客| 永德县| 东乌珠穆沁旗|