官术网_书友最值得收藏!

Logistic Regression

"The true logic of this world is the calculus of probabilities."
- James Clerk Maxwell, Scottish physicist

In the previous chapter, we took a look at using Ordinary Least Squares (OLS) to predict a quantitative outcome or, in other words, linear regression. It's now time to shift gears somewhat and examine how we can develop algorithms to predict qualitative outcomes. Such outcome variables could be binary (male versus female, purchase versus doesn't purchase, or a tumor is benign versus malignant) or multinomial categories (education level or eye color). Regardless of whether the outcome of interest is binary or multinomial, our task is to predict the probability of an observation belonging to a particular category of the outcome variable. In other words, we develop an algorithm to classify the observations.

To begin exploring classification problems, we'll discuss why applying the OLS linear regression isn't the correct technique and how the algorithms introduced in this chapter can solve these issues. We'll then look at the problem of predicting whether or not a banking customer is satisfied. To tackle this problem, we'll begin by building and interpreting a logistic regression model. We'll also start examining a univariate method to select features. Next, we'll turn to multivariate regression splines and discover ways to choose the best overall algorithm. This chapter will set the stage for more advanced machine learning methods in subsequent chapters.

We'll be covering the following topics in this chapter:

  • Classification methods and linear regression
  • Logistic regression
  • Model training and evaluation
主站蜘蛛池模板: 类乌齐县| 朔州市| 长丰县| 宜川县| 高邮市| 新龙县| 巩留县| 黑河市| 金溪县| 浦北县| 丘北县| 阿鲁科尔沁旗| 托克逊县| 汤原县| 樟树市| 桐乡市| 泰和县| 诸暨市| 贵州省| 秦安县| 大洼县| 孝昌县| 伊川县| 阜阳市| 巢湖市| 青田县| 福清市| 曲阳县| 沁水县| 清涧县| 兴宁市| 东明县| 虹口区| 海盐县| 福海县| 夏津县| 恭城| 洛隆县| 廉江市| 广德县| 陇川县|