- Effective Amazon Machine Learning
- Alexis Perrier
- 208字
- 2021-07-03 00:17:52
Underfitting
Underfitting means that the model was poorly trained. Either the training dataset did not have enough information to infer strong predictions, or the algorithm that trained the model on the training dataset was not adequate for the context. The algorithm was not well parameterized or simply inadequate for the data.
If we measure the prediction error not only on the validation set but also on the training set, the prediction error will be large if the model is underfitting. Which makes sense: if the model cannot predict the training, it won't be able to predict the outcomes in the validation set it has not seen before. Underfitting basically means your model is not working.
Common strategies to palliate this problem include:
- Getting more data samples – If the problem comes from a dataset that is too small or does not contain sufficient information, getting more data may improve the model performance.
- Adding more features, raw or via feature engineering – by taking the log, squaring, binning, using splines or power functions. Adding many features and seeing how that improves the predictions.
- Choosing another model – Support Vector Machine, Random Forest, Boosted trees, Bayes classifiers all have different strengths in different contexts.
推薦閱讀
- Unity 5.x Game AI Programming Cookbook
- Visual Studio 2015 Cookbook(Second Edition)
- 文本數(shù)據(jù)挖掘:基于R語言
- 大數(shù)據(jù):規(guī)劃、實(shí)施、運(yùn)維
- 數(shù)據(jù)驅(qū)動(dòng):從方法到實(shí)踐
- Power BI商業(yè)數(shù)據(jù)分析完全自學(xué)教程
- Hands-On Mathematics for Deep Learning
- LabVIEW 完全自學(xué)手冊
- Proxmox VE超融合集群實(shí)踐真?zhèn)?/a>
- Hadoop集群與安全
- Oracle 內(nèi)核技術(shù)揭密
- 大數(shù)據(jù)計(jì)算系統(tǒng)原理、技術(shù)與應(yīng)用
- 精通Neo4j
- Configuration Management with Chef-Solo
- 數(shù)據(jù)庫原理及應(yīng)用實(shí)驗(yàn):基于GaussDB的實(shí)現(xiàn)方法