- Statistics for Machine Learning
- Pratap Dangeti
- 304字
- 2021-07-02 19:05:55
Bias versus variance trade-off
Every model has both bias and variance error components in addition to white noise. Bias and variance are inversely related to each other; while trying to reduce one component, the other component of the model will increase. The true art lies in creating a good fit by balancing both. The ideal model will have both low bias and low variance.
Errors from the bias component come from erroneous assumptions in the underlying learning algorithm. High bias can cause an algorithm to miss the relevant relations between features and target outputs; this phenomenon causes an underfitting problem.
On the other hand, errors from the variance component come from sensitivity to change in the fit of the model, even a small change in training data; high variance can cause an overfitting problem:

An example of a high bias model is logistic or linear regression, in which the fit of the model is merely a straight line and may have a high error component due to the fact that a linear model could not approximate underlying data well.
An example of a high variance model is a decision tree, in which the model may create too much wiggly curve as a fit, in which even a small change in training data will cause a drastic change in the fit of the curve.
At the moment, state-of-the-art models are utilizing high variance models such as decision trees and performing ensemble on top of them to reduce the errors caused by high variance and at the same time not compromising on increases in errors due to the bias component. The best example of this category is random forest, in which many decision trees will be grown independently and ensemble in order to come up with the best fit; we will cover this in upcoming chapters:

- ClickHouse性能之巔:從架構(gòu)設(shè)計解讀性能之謎
- Microsoft Exchange Server PowerShell Cookbook(Third Edition)
- Git Version Control Cookbook
- C和C++安全編碼(原書第2版)
- 數(shù)據(jù)庫系統(tǒng)原理及MySQL應(yīng)用教程
- VSTO開發(fā)入門教程
- 程序員修煉之道:通向務(wù)實的最高境界(第2版)
- Python編程實戰(zhàn)
- Swift語言實戰(zhàn)精講
- Mastering JavaScript High Performance
- Asynchronous Android Programming(Second Edition)
- 西門子S7-200 SMART PLC編程從入門到實踐
- GameMaker Essentials
- Citrix XenServer企業(yè)運維實戰(zhàn)
- Bootstrap for Rails