官术网_书友最值得收藏!

Underfitting

Underfitting means that the model was poorly trained. Either the training dataset did not have enough information to infer strong predictions, or the algorithm that trained the model on the training dataset was not adequate for the context. The algorithm was not well parameterized or simply inadequate for the data.

If we measure the prediction error not only on the validation set but also on the training set, the prediction error will be large if the model is underfitting. Which makes sense: if the model cannot predict the training, it won't be able to predict the outcomes in the validation set it has not seen before. Underfitting basically means your model is not working.

Common strategies to palliate this problem include:

  • Getting more data samples – If the problem comes from a dataset that is too small or does not contain sufficient information, getting more data may improve the model performance.
  • Adding more features, raw or via feature engineering – by taking the log, squaring, binning, using splines or power functions. Adding many features and seeing how that improves the predictions.
  • Choosing another model – Support Vector Machine, Random Forest, Boosted trees, Bayes classifiers all have different strengths in different contexts.
主站蜘蛛池模板: 随州市| 郎溪县| 潢川县| 上杭县| 崇州市| 庄浪县| 彩票| 霍林郭勒市| 嘉禾县| 宜丰县| 邳州市| 东乡县| 饶阳县| 河西区| 灵台县| 平阴县| 阿瓦提县| 东源县| 诸暨市| 通州区| 获嘉县| 新巴尔虎左旗| 祥云县| 溧水县| 通道| 江津市| 新竹县| 孝昌县| 景东| 勃利县| 平度市| 江口县| 阿拉尔市| 建湖县| 龙门县| 临洮县| 基隆市| 贵南县| 稷山县| 望谟县| 雅安市|