- Mastering Predictive Analytics with R(Second Edition)
- James D. Miller Rui Miguel Forte
- 139字
- 2021-07-02 20:25:15
Getting started
Before we get started with discussing the process of tidying data, it would be very prudent to point out that whatever you do to tidy your data, you should be sure to:
- Create and save your scripts so that you can use them again for new or similar data sources. This is referred to as reusability. Why spend time recreating the same code, rules, or logic if you don't have to? This applies to new data within the same project (that the scripts were developed for) or new projects you may be involved with in the future.
- Tidy your data as "far upstream" as possible, perhaps even at the original source. In other words, save and maintain the original data, but use programmatic scripts to clean it, fix mistakes, and save that cleaned dataset for further analysis.
推薦閱讀
- Vue.js設(shè)計與實現(xiàn)
- CMDB分步構(gòu)建指南
- Getting Started with CreateJS
- 薛定宇教授大講堂(卷Ⅳ):MATLAB最優(yōu)化計算
- Java加密與解密的藝術(shù)
- Mathematica Data Analysis
- 數(shù)據(jù)結(jié)構(gòu)與算法分析(C++語言版)
- OpenStack Orchestration
- Learning Concurrency in Kotlin
- Web性能實戰(zhàn)
- 計算機應(yīng)用基礎(chǔ)項目化教程
- ASP.NET求職寶典
- Tableau Desktop可視化高級應(yīng)用
- 精益軟件開發(fā)管理之道
- C語言程序設(shè)計