- Mastering Machine Learning with R(Second Edition)
- Cory Lesmeister
- 156字
- 2021-07-09 18:23:53
Data preparation
Almost there! This step has the following five tasks:
- Selecting the data.
- Cleaning the data.
- Constructing the data.
- Integrating the data.
- Formatting the data.
These tasks are relatively self-explanatory. The goal is to get the data ready to input in the algorithms. This includes merging, feature engineering, and transformations. If imputation is needed, then it happens here as well. Additionally, with R, pay attention to how the outcome needs to be labeled. If your outcome/response variable is Yes/No, it may not work in some packages and will require a transformed or no variable with 1/0. At this point, you should also break your data into the various test sets if applicable: train, test, or validate. This step can be an unmitigated burden, but most experienced people will tell you that it is where you can separate yourself from your peers. With this, let's move on to the payoff, where you earn your money.
- 程序員修煉之道:從小工到專家
- Python數據分析入門:從數據獲取到可視化
- Voice Application Development for Android
- 大數據可視化
- MySQL基礎教程
- 大數據算法
- Access 2016數據庫技術及應用
- 深入淺出MySQL:數據庫開發、優化與管理維護(第2版)
- 大數據時代下的智能轉型進程精選(套裝共10冊)
- 大數據Hadoop 3.X分布式處理實戰
- 圖數據實戰:用圖思維和圖技術解決復雜問題
- 重復數據刪除技術:面向大數據管理的縮減技術
- MySQL數據庫實用教程
- Swift Functional Programming(Second Edition)
- Machine Learning for Mobile