官术网_书友最值得收藏!

Training different regression models

The following screenshot shows a dataframe where we are going to save performance. We are going to run four models, namely logistic regression, bagging, random forest, and boosting:

We are going to use the following evaluation metrics in this case:

  • accuracy: This metric measures how often the model predicts defaulters and non-defaulters correctly
  • precision: This metric will be when the model predicts the default and how often the model is correct
  • recall: This metric will be the proportion of actual defaulters that the model will correctly predict

The most important of these is the recall metric. The reason behind this is that we want to maximize the proportion of actual defaulters that the model identifies, and so the model with the best recall is selected.

主站蜘蛛池模板: 肥乡县| 阳高县| 夏邑县| 淅川县| 澄迈县| 龙陵县| 民勤县| 永德县| 万载县| 呼伦贝尔市| 长泰县| 上饶市| 宝应县| 青海省| 金沙县| 吉林市| 威信县| 丰原市| 南雄市| 阜城县| 深圳市| 日喀则市| 原平市| 宜丰县| 海口市| 刚察县| 库尔勒市| 延津县| 青浦区| 融水| 邛崃市| 高青县| 化德县| 雅江县| 高安市| 庐江县| 凉山| 威海市| 惠水县| 灵台县| 从江县|