- Mastering Machine Learning with R
- Cory Lesmeister
- 146字
- 2021-07-02 13:46:24
Summary
In this chapter, we looked at using probabilistic linear models to predict a qualitative response with two generalized linear model methods: logistic regression, and multivariate adaptive regression splines. We explored using the weight of information and information value as a technique to do univariate feature selection. We covered the concept of finding the proper probability threshold to minimize classification error. Additionally, we began the process of using various performance metrics such as AUC, log-loss, and ROC charts to explore model selection visually and statistically. These metrics proved to be more informative than just pure accuracy, especially in a situation where class labels are highly imbalanced. In the next chapter, we'll cover regularization methods for feature selection, and how it can be used in training your algorithms. We'll see how we can create a dataset. We'll know about ridge regression and dive deeper in feature selection.
- 工業(yè)機(jī)器人產(chǎn)品應(yīng)用實(shí)戰(zhàn)
- 大數(shù)據(jù)專業(yè)英語
- 并行數(shù)據(jù)挖掘及性能優(yōu)化:關(guān)聯(lián)規(guī)則與數(shù)據(jù)相關(guān)性分析
- 來吧!帶你玩轉(zhuǎn)Excel VBA
- 大型數(shù)據(jù)庫管理系統(tǒng)技術(shù)、應(yīng)用與實(shí)例分析:SQL Server 2005
- Prometheus監(jiān)控實(shí)戰(zhàn)
- 水下無線傳感器網(wǎng)絡(luò)的通信與決策技術(shù)
- TensorFlow Reinforcement Learning Quick Start Guide
- 分析力!專業(yè)Excel的制作與分析實(shí)用法則
- Visual Studio 2010 (C#) Windows數(shù)據(jù)庫項(xiàng)目開發(fā)
- Excel 2007終極技巧金典
- MATLAB-Simulink系統(tǒng)仿真超級(jí)學(xué)習(xí)手冊(cè)
- 寒江獨(dú)釣:Windows內(nèi)核安全編程
- 中老年人學(xué)電腦與上網(wǎng)
- Hands-On Geospatial Analysis with R and QGIS