官术网_书友最值得收藏!

Classification methods and linear regression

So, why can't we just use the least square regression method that we learned in the previous chapter for a qualitative outcome? Well, as it turns out, you can, but at your own risk. Let's assume for a second that you have an outcome that you are trying to predict and it has three different classes: mild, moderate, and severe. You and your colleagues also assume that the difference between mild and moderate and moderate and severe is an equivalent measure and a linear relationship. You can create a dummy variable where 0 is equal to mild, 1 is equal to moderate, and 2 is equal to severe. If you have reason to believe this, then linear regression might be an acceptable solution. However, qualitative assessments such as the previous ones might lend themselves to a high level of measurement error that can bias the OLS. In most business problems, there is no scientifically acceptable way to convert a qualitative response to one that is quantitative. What if you have a response with two outcomes, say fail and pass? Again, using the dummy variable approach, we could code the fail outcome as 0 and the pass outcome as 1. Using linear regression, we could build a model where the predicted value is the probability of an observation of pass or fail. However, the estimates of Y in the model will most likely exceed the probability constraints of [0,1] and thus be a bit difficult to interpret.

主站蜘蛛池模板: 鄯善县| 革吉县| 玛纳斯县| 无为县| 仙桃市| 招远市| 龙南县| 华安县| 定兴县| 庆阳市| 垫江县| 舞阳县| 竹山县| 吉林市| 时尚| 万州区| 兴国县| 大理市| 乐东| 泰顺县| 乌鲁木齐市| 银川市| 嵊泗县| 新津县| 石景山区| 武陟县| 策勒县| 永济市| 项城市| 大渡口区| 纳雍县| 铁力市| 济源市| 平乡县| 庆元县| 进贤县| 丹巴县| 承德县| 顺义区| 天柱县| 壤塘县|