- Hands-On Ensemble Learning with R
- Prabhanjan Narayanachar Tattar
- 346字
- 2021-07-23 19:10:55
The jackknife technique
Quenouille (1949) invented the jackknife technique. The purpose of this was to reduce bias by looking at multiple samples of data in a methodical way. The name jackknife seems to have been coined by the well-known statistician John W. Tukey. Due mainly to the lack of computational power, the advances and utility of the jackknife method were restricted. Efron invented the bootstrap method in 1979 (see the following section for its applications) and established the connection with the jackknife method. In fact, these two methods have a lot in common and are generally put under the umbrella of resampling methods.
Suppose that we draw a random sample of size n from a probability distribution F, and we denote by
the parameter of interest. Let
be an estimator of
, and here we don't have the probability distribution of
for a given
. Resampling methods will help in carrying out statistical inference when the probability distribution is unknown. A formal definition of the concept is in order.
Definition: Resampling methods are ways of estimating the bias and variance of the estimator that uses the values of
based on subsamples from the available observations
.
The jackknife technique is a resampling method, and we will lay down its general procedure in the ensuing discussion. As stated previously, is an estimator of
. For simplicity, we define the vector of the given observations by
. The important quantity in setting up this procedure is the pseudovalue, and we will define this mathematically next.
Definition: Let , that is,
is the vector
without the i-th observation. The i-th pseudovalue of
is then defined as follows:

It can be mathematically demonstrated that the pseudovalue is equivalent to the following:

Thus, the pseudovalue is seen as the bias-corrected version of . The pseudovalues defined here are also referred to as delete-one jackknife. The jackknife method treats the pseudovalues as independent observations with mean
, and then applies the central limit theorem for carrying out the statistical inference. The mean and (sampling) variance of the pseudovalues is given as follows:


- 樂高機器人:WeDo編程與搭建指南
- 玩轉(zhuǎn)智能機器人程小奔
- Design for the Future
- Canvas LMS Course Design
- Cloud Analytics with Microsoft Azure
- 城市道路交通主動控制技術(shù)
- Photoshop CS3特效處理融會貫通
- B2B2C網(wǎng)上商城開發(fā)指南
- 大學計算機應用基礎
- CompTIA Network+ Certification Guide
- 自動控制理論(非自動化專業(yè))
- ESP8266 Home Automation Projects
- Photoshop行業(yè)應用基礎
- Raspberry Pi Projects for Kids
- 菜鳥起飛五筆打字高手