官术网_书友最值得收藏!

Training different regression models

The following screenshot shows a dataframe where we are going to save performance. We are going to run four models, namely logistic regression, bagging, random forest, and boosting:

We are going to use the following evaluation metrics in this case:

  • accuracy: This metric measures how often the model predicts defaulters and non-defaulters correctly
  • precision: This metric will be when the model predicts the default and how often the model is correct
  • recall: This metric will be the proportion of actual defaulters that the model will correctly predict

The most important of these is the recall metric. The reason behind this is that we want to maximize the proportion of actual defaulters that the model identifies, and so the model with the best recall is selected.

主站蜘蛛池模板: 南江县| 内丘县| 乐昌市| 青田县| 阜阳市| 阜阳市| 大化| 淅川县| 河南省| 鹤峰县| 枞阳县| 普兰县| 乳山市| 阿拉善左旗| 会昌县| 承德县| 板桥市| 昌都县| 余姚市| 青龙| 宜春市| 长白| 达孜县| 福州市| 扶风县| 航空| 垦利县| 兴文县| 岚皋县| 舒兰市| 尤溪县| 禹州市| 来凤县| 罗田县| 施秉县| 金塔县| 马边| 新田县| 连平县| 通渭县| 错那县|