官术网_书友最值得收藏!

Classification methods and linear regression

So, why can't we use the least square regression method that we learned in the previous chapter for a qualitative outcome? Well, as it turns out, you can, but at your own risk. Let's assume for a second that you have an outcome that you're trying to predict and it has three different classes: mild, moderate, and severe. You and your colleagues also assume that the difference between mild and moderate and moderate and severe is an equivalent measure and a linear relationship. You can create a dummy variable where 0 is equal to mild, 1 is equal to moderate, and 2 is equal to severe. If you have reason to believe this, then linear regression might be an acceptable solution. However, qualitative labels such as the previous ones might lend themselves to a high level of measurement error that can bias the OLS. In most business problems, there's no scientifically acceptable way to convert a qualitative response into one that's quantitative. What if you have a response with two outcomes, say fail and pass? Again, using the dummy variable approach, we could code the fail outcome as 0 and the pass outcome as 1. Using linear regression, we could build a model where the predicted value is the probability of an observation of pass or fail. However, the estimates of Y in the model will most likely exceed the probability constraints of [0,1] and hence be a bit difficult to interpret.

主站蜘蛛池模板: 邢台市| 昭苏县| 塔城市| 扎兰屯市| 林西县| 鲜城| 剑川县| 永丰县| 洛南县| 凤凰县| 阜康市| 临西县| 承德县| 华蓥市| 武邑县| 昆明市| 重庆市| 襄垣县| 和林格尔县| 星子县| 亳州市| 万载县| 华容县| 刚察县| 从化市| 乌拉特中旗| 寿光市| 栾川县| 贵阳市| 堆龙德庆县| 沧源| 招远市| 三穗县| 曲阳县| 秦皇岛市| 新安县| 三门县| 海南省| 定结县| 景德镇市| 怀宁县|