- Mastering Predictive Analytics with scikit:learn and TensorFlow
- Alan Fontaine
- 215字
- 2021-07-23 16:42:24
Bootstrap sampling
Many ensemble learning methods use a statistical technique called bootstrap sampling. A bootstrap sample of a dataset is another dataset that's obtained by randomly sampling the observations from the original dataset with replacement.
This technique is heavily used in statistics, for example; it is used for estimating standard errors on sample statistics like mean or standard deviation of values.
Let's understand this technique more by taking a look at the following diagram:

Let's assume that we have a population of 1 to 10, which can be considered original population data. To get a bootstrap sample, we need to draw 10 samples from the original data with replacement. Imagine you have the 10 numbers written in 10 cards in a hat; for the first element of your sample, you take one card at random from the hat and write it down, then put the card back in the hat and this process goes on until you get 10 elements. This is your bootstrap sample. As you can see in the preceding example, 9 is repeated thrice in the bootstrap sample.
This resampling of numbers with replacement improves the accuracy of the true population data. It also helps in understanding various discrepancies and features involved in the resampling process, thereby increasing accuracy of the same.
- Microsoft Power BI Quick Start Guide
- PowerShell 3.0 Advanced Administration Handbook
- Visual C# 2008開發(fā)技術(shù)實(shí)例詳解
- Python Algorithmic Trading Cookbook
- 工業(yè)機(jī)器人現(xiàn)場編程(FANUC)
- 計(jì)算機(jī)網(wǎng)絡(luò)原理與技術(shù)
- 基于Xilinx ISE的FPAG/CPLD設(shè)計(jì)與應(yīng)用
- Deep Reinforcement Learning Hands-On
- 網(wǎng)站入侵與腳本攻防修煉
- Salesforce for Beginners
- 多媒體制作與應(yīng)用
- 從零開始學(xué)SQL Server
- 智能鼠原理與制作(進(jìn)階篇)
- HBase Essentials
- 網(wǎng)絡(luò)安全原理與應(yīng)用