- Mastering Machine Learning with R(Second Edition)
- Cory Lesmeister
- 156字
- 2021-07-09 18:23:53
Data preparation
Almost there! This step has the following five tasks:
- Selecting the data.
- Cleaning the data.
- Constructing the data.
- Integrating the data.
- Formatting the data.
These tasks are relatively self-explanatory. The goal is to get the data ready to input in the algorithms. This includes merging, feature engineering, and transformations. If imputation is needed, then it happens here as well. Additionally, with R, pay attention to how the outcome needs to be labeled. If your outcome/response variable is Yes/No, it may not work in some packages and will require a transformed or no variable with 1/0. At this point, you should also break your data into the various test sets if applicable: train, test, or validate. This step can be an unmitigated burden, but most experienced people will tell you that it is where you can separate yourself from your peers. With this, let's move on to the payoff, where you earn your money.
- 云數據中心基礎
- Oracle RAC 11g實戰指南
- 分布式數據庫系統:大數據時代新型數據庫技術(第3版)
- Learning Spring Boot
- 醫療大數據挖掘與可視化
- Python金融實戰
- “互聯網+”時代立體化計算機組
- 數據庫技術實用教程
- IPython Interactive Computing and Visualization Cookbook(Second Edition)
- 一本書講透Elasticsearch:原理、進階與工程實踐
- 聯動Oracle:設計思想、架構實現與AWR報告
- Gideros Mobile Game Development
- 數據庫應用系統技術
- MySQL技術內幕:InnoDB存儲引擎
- 數據賦能