- Statistics for Data Science
- James D. Miller
- 342字
- 2021-07-02 14:58:54
Regularization
Regularization is one possible approach that a data scientist may use for improving the results generated from a statistical model or data science process, such as when addressing a case of overfitting in statistics and data science.
Overfitting usually occurs with an overly simple model. This means that you may have only two variables and are drawing conclusions based on the two. For example, using our previously mentioned example of daffodil sales, one might generate a model with temperature as an independent variable and sales as a dependent one. You may see the model fail since it is not as simple as concluding that warmer temperatures will always generate more sales.
In this example, there is a tendency to add more data to the process or model in hopes of achieving a better result. The idea sounds reasonable. For example, you have information such as average rainfall, pollen count, fertilizer sales, and so on; could these data points be added as explanatory variables?
Continuing to add more and more data to your model will have an effect but will probably cause overfitting, resulting in poor predictions since it will closely resemble the data, which is mostly just background noise.
To overcome this situation, a data scientist can use regularization, introducing a tuning parameter (additional factors such as a data points mean value or a minimum or maximum limitation, which gives you the ability to change the complexity or smoothness of your model) into the data science process to solve an ill-posed problem or to prevent overfitting.
- Word 2000、Excel 2000、PowerPoint 2000上機指導與練習
- 樂高機器人EV3設計指南:創(chuàng)造者的搭建邏輯
- Natural Language Processing Fundamentals
- 商戰(zhàn)數(shù)據(jù)挖掘:你需要了解的數(shù)據(jù)科學與分析思維
- Dreamweaver CS3網(wǎng)頁設計與網(wǎng)站建設詳解
- 21天學通C#
- OpenStack Cloud Computing Cookbook(Second Edition)
- Windows游戲程序設計基礎
- CompTIA Linux+ Certification Guide
- Splunk Operational Intelligence Cookbook
- 愛犯錯的智能體
- 新編計算機組裝與維修
- 水下無線傳感器網(wǎng)絡的通信與決策技術
- TensorFlow Reinforcement Learning Quick Start Guide
- LMMS:A Complete Guide to Dance Music Production Beginner's Guide