- Hands-On Ensemble Learning with R
- Prabhanjan Narayanachar Tattar
- 118字
- 2021-07-23 19:10:50
Pima Indians Diabetes
Diabetes is a health hazard, which is mostly incurable, and patients who are diagnosed with it have to adjust their lifestyles in order to cater to this condition. Based on variables such as pregnant
, glucose
, pressure
, triceps
, insulin
, mass
, pedigree
, and age
, the problem here is to classify the person as diabetic or not. Here, we have 768 observations. This dataset is drawn from the mlbench
package:
> data("PimaIndiansDiabetes") > set.seed(12345) > Train_Test <- sample(c("Train","Test"),nrow(PimaIndiansDiabetes),replace = TRUE, + prob = c(0.7,0.3)) > head(Train_Test) [1] "Test" "Test" "Test" "Test" "Train" "Train" > PimaIndiansDiabetes_Train <- PimaIndiansDiabetes[Train_Test=="Train",] > PimaIndiansDiabetes_TestX <- within(PimaIndiansDiabetes[Train_Test=="Test",], + rm(diabetes)) > PimaIndiansDiabetes_TestY <- PimaIndiansDiabetes[Train_Test=="Test","diabetes"] > PID_Formula <- as.formula("diabetes~.")
The five datasets described up to this point are classification problems. We look at one example each for regression, time series, survival, clustering, and outlier detection problems.
推薦閱讀
- TIBCO Spotfire:A Comprehensive Primer(Second Edition)
- IoT Penetration Testing Cookbook
- 快學Flash動畫百例
- ROS機器人編程與SLAM算法解析指南
- Multimedia Programming with Pure Data
- 機器人創新實訓教程
- Ceph:Designing and Implementing Scalable Storage Systems
- Learning Azure Cosmos DB
- Word 2007,Excel 2007辦公應用融會貫通
- 空間機械臂建模、規劃與控制
- Hands-On Data Warehousing with Azure Data Factory
- 工業機器人操作
- 深度學習之模型優化:核心算法與案例實踐
- Windows Server 2012 Automation with PowerShell Cookbook
- ASP.NET 4.0 MVC敏捷開發給力起飛