- Mastering Machine Learning with R
- Cory Lesmeister
- 151字
- 2021-07-09 21:28:18
Data preparation
Almost there! This step has the following five tasks:
- Select the data
- Clean the data
- Construct the data
- Integrate the data
- Format the data
These tasks are relatively self-explanatory. The goal is to get the data ready to input in the algorithms. This includes merging, feature engineering, and transformations. If imputation is needed, then it happens here as well. Additionally, with R, pay attention to how the outcome needs to be labeled. If your outcome/response variable is Yes/No, it may not work in some packages and will require a transformed or no variable with 1/0. At this point, you should also break your data into the various test sets if applicable: train, test, or validate. This step can be an unforgivable burden, but most experienced people will tell you that it is where you can separate yourself from your peers. With this, let's move on to the money step.
- INSTANT Mock Testing with PowerMock
- Java 開發(fā)從入門到精通(第2版)
- Learning PostgreSQL
- ReSharper Essentials
- 數(shù)據(jù)結(jié)構(gòu)和算法基礎(chǔ)(Java語言實(shí)現(xiàn))
- ASP.NET Core 5.0開發(fā)入門與實(shí)戰(zhàn)
- MySQL 8 DBA基礎(chǔ)教程
- Apache Hive Essentials
- Visual Basic程序設(shè)計教程
- 精通Python自然語言處理
- Raspberry Pi Home Automation with Arduino(Second Edition)
- C語言程序設(shè)計
- Visual Foxpro 9.0數(shù)據(jù)庫程序設(shè)計教程
- OpenStack Networking Essentials
- R語言數(shù)據(jù)可視化:科技圖表繪制