官术网_书友最值得收藏!

Underfitting

Underfitting means that the model was poorly trained. Either the training dataset did not have enough information to infer strong predictions, or the algorithm that trained the model on the training dataset was not adequate for the context. The algorithm was not well parameterized or simply inadequate for the data.

If we measure the prediction error not only on the validation set but also on the training set, the prediction error will be large if the model is underfitting. Which makes sense: if the model cannot predict the training, it won't be able to predict the outcomes in the validation set it has not seen before. Underfitting basically means your model is not working.

Common strategies to palliate this problem include:

  • Getting more data samples – If the problem comes from a dataset that is too small or does not contain sufficient information, getting more data may improve the model performance.
  • Adding more features, raw or via feature engineering – by taking the log, squaring, binning, using splines or power functions. Adding many features and seeing how that improves the predictions.
  • Choosing another model – Support Vector Machine, Random Forest, Boosted trees, Bayes classifiers all have different strengths in different contexts.
主站蜘蛛池模板: 喀喇沁旗| 丁青县| 内江市| 连城县| 郧西县| 湛江市| 德庆县| 南宁市| 禹州市| 济宁市| 常德市| 六枝特区| 尼勒克县| 车险| 河北省| 通渭县| 葫芦岛市| 且末县| 平南县| 津南区| 凤翔县| 修武县| 多伦县| 桦甸市| 锡林浩特市| 大新县| 龙门县| 永平县| 博兴县| 密山市| 康马县| 德格县| 武平县| 汾西县| 博乐市| 天气| 和顺县| 浦县| 岱山县| 阳东县| 同仁县|