- Effective Amazon Machine Learning
- Alexis Perrier
- 208字
- 2021-07-03 00:17:52
Underfitting
Underfitting means that the model was poorly trained. Either the training dataset did not have enough information to infer strong predictions, or the algorithm that trained the model on the training dataset was not adequate for the context. The algorithm was not well parameterized or simply inadequate for the data.
If we measure the prediction error not only on the validation set but also on the training set, the prediction error will be large if the model is underfitting. Which makes sense: if the model cannot predict the training, it won't be able to predict the outcomes in the validation set it has not seen before. Underfitting basically means your model is not working.
Common strategies to palliate this problem include:
- Getting more data samples – If the problem comes from a dataset that is too small or does not contain sufficient information, getting more data may improve the model performance.
- Adding more features, raw or via feature engineering – by taking the log, squaring, binning, using splines or power functions. Adding many features and seeing how that improves the predictions.
- Choosing another model – Support Vector Machine, Random Forest, Boosted trees, Bayes classifiers all have different strengths in different contexts.
推薦閱讀
- Python絕技:運用Python成為頂級數據工程師
- 云數據中心基礎
- Unity 5.x Game AI Programming Cookbook
- 分布式數據庫系統:大數據時代新型數據庫技術(第3版)
- 業務數據分析:五招破解業務難題
- 商業分析思維與實踐:用數據分析解決商業問題
- Access 2016數據庫技術及應用
- Python數據分析:基于Plotly的動態可視化繪圖
- 數字媒體交互設計(初級):Web產品交互設計方法與案例
- 云原生數據中臺:架構、方法論與實踐
- Unreal Engine Virtual Reality Quick Start Guide
- 智慧城市中的大數據分析技術
- Internet of Things with Python
- 中國云存儲發展報告
- 大數據技術體系詳解:原理、架構與實踐