- Hands-On Ensemble Learning with R
- Prabhanjan Narayanachar Tattar
- 158字
- 2021-07-23 19:10:50
German Credit
Loans are not always repaid in full, and there are defaulters. In this case, it becomes important for the bank to identify potential defaulters based on the available information. Here, we adapt the GC
dataset from the RSADBE
package to properly reflect the labels of the factor variable. The transformed dataset is available as GC2.RData
in the data folder. The GC
dataset itself is mainly an adaptation of the version available at https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data). Here, we have 1,000 observations, and 20 covariate/independent variables such as the status of existing checking account, duration, and so forth. The final status of whether the loan was completely paid or not is available in the good_bad
column. We will partition the data into training and testing parts, and create the formula too:
> library(RSADBE) > load("../Data/GC2.RData") > table(GC2$good_bad) bad good 300 700 > set.seed(12345) > Train_Test <- sample(c("Train","Test"),nrow(GC2),replace = TRUE,prob=c(0.7,0.3)) > head(Train_Test) [1] "Test" "Test" "Test" "Test" "Train" "Train" > GC2_Train <- GC2[Train_Test=="Train",] > GC2_TestX <- within(GC2[Train_Test=="Test",],rm(good_bad)) > GC2_TestY <- GC2[Train_Test=="Test","good_bad"] > GC2_Formula <- as.formula("good_bad~.")
- 大數(shù)據(jù)技術(shù)與應(yīng)用基礎(chǔ)
- STM32G4入門與電機(jī)控制實(shí)戰(zhàn):基于X-CUBE-MCSDK的無(wú)刷直流電機(jī)與永磁同步電機(jī)控制實(shí)現(xiàn)
- 自動(dòng)化控制工程設(shè)計(jì)
- 信息物理系統(tǒng)(CPS)測(cè)試與評(píng)價(jià)技術(shù)
- MPC5554/5553微處理器揭秘
- C#求職寶典
- 智能制造系統(tǒng)及關(guān)鍵使能技術(shù)
- 機(jī)器人剛?cè)狁詈蟿?dòng)力學(xué)
- Advanced Deep Learning with Keras
- DynamoDB Applied Design Patterns
- Hands-On Geospatial Analysis with R and QGIS
- Practical Network Automation
- Flash CS3動(dòng)畫制作融會(huì)貫通
- ASP.NET學(xué)習(xí)手冊(cè)
- ARM? Cortex? M4 Cookbook