官术网_书友最值得收藏!

Summary

In this chapter, we looked at using probabilistic linear models to predict a qualitative response with two generalized linear model methods: logistic regression, and multivariate adaptive regression splines. We explored using the weight of information and information value as a technique to do univariate feature selection. We covered the concept of finding the proper probability threshold to minimize classification error. Additionally, we began the process of using various performance metrics such as AUC, log-loss, and ROC charts to explore model selection visually and statistically. These metrics proved to be more informative than just pure accuracy, especially in a situation where class labels are highly imbalanced. In the next chapter, we'll cover regularization methods for feature selection, and how it can be used in training your algorithms. We'll see how we can create a dataset. We'll know about ridge regression and dive deeper in feature selection.

主站蜘蛛池模板: 湖北省| 车险| 武夷山市| 青浦区| 南康市| 霞浦县| 买车| 汪清县| 竹北市| 铁岭市| 长阳| 洛隆县| 武冈市| 怀仁县| 古浪县| 周口市| 隆德县| 东至县| 泸水县| 江西省| 兴安县| 深州市| 新邵县| 桓台县| 皋兰县| 宿迁市| 大新县| 定边县| 大庆市| 延川县| 通榆县| 通城县| 新源县| 灵璧县| 应用必备| 明水县| 沧州市| 禹城市| 江源县| 赤城县| 囊谦县|