官术网_书友最值得收藏!

Underfitting

Underfitting means that the model was poorly trained. Either the training dataset did not have enough information to infer strong predictions, or the algorithm that trained the model on the training dataset was not adequate for the context. The algorithm was not well parameterized or simply inadequate for the data.

If we measure the prediction error not only on the validation set but also on the training set, the prediction error will be large if the model is underfitting. Which makes sense: if the model cannot predict the training, it won't be able to predict the outcomes in the validation set it has not seen before. Underfitting basically means your model is not working.

Common strategies to palliate this problem include:

  • Getting more data samples – If the problem comes from a dataset that is too small or does not contain sufficient information, getting more data may improve the model performance.
  • Adding more features, raw or via feature engineering – by taking the log, squaring, binning, using splines or power functions. Adding many features and seeing how that improves the predictions.
  • Choosing another model – Support Vector Machine, Random Forest, Boosted trees, Bayes classifiers all have different strengths in different contexts.
主站蜘蛛池模板: 庆城县| 衡南县| 乐山市| 涞源县| 虹口区| 玉山县| 云浮市| 平顶山市| 宜黄县| 德化县| 莎车县| 陵水| 杭锦旗| 八宿县| 安泽县| 六盘水市| 霸州市| 三台县| 松潘县| 闸北区| 余干县| 冷水江市| 松溪县| 新疆| 驻马店市| 台中县| 临朐县| 若羌县| 洛扎县| 多伦县| 台北县| 老河口市| 红河县| 长丰县| 巴彦县| 安西县| 故城县| 五指山市| 涞源县| 潞西市| 安义县|