- Mastering Predictive Analytics with R(Second Edition)
- James D. Miller Rui Miguel Forte
- 139字
- 2021-07-02 20:25:15
Getting started
Before we get started with discussing the process of tidying data, it would be very prudent to point out that whatever you do to tidy your data, you should be sure to:
- Create and save your scripts so that you can use them again for new or similar data sources. This is referred to as reusability. Why spend time recreating the same code, rules, or logic if you don't have to? This applies to new data within the same project (that the scripts were developed for) or new projects you may be involved with in the future.
- Tidy your data as "far upstream" as possible, perhaps even at the original source. In other words, save and maintain the original data, but use programmatic scripts to clean it, fix mistakes, and save that cleaned dataset for further analysis.
推薦閱讀
- INSTANT OpenCV Starter
- 編寫整潔的Python代碼(第2版)
- Python Tools for Visual Studio
- Practical DevOps
- 深入理解Java7:核心技術(shù)與最佳實踐
- MATLAB定量決策五大類問題
- Apache Mesos Essentials
- 基于Swift語言的iOS App 商業(yè)實戰(zhàn)教程
- 信息技術(shù)應用基礎
- Python Data Analysis Cookbook
- Extending Unity with Editor Scripting
- Python機器學習與量化投資
- Drupal Search Engine Optimization
- Groovy 2 Cookbook
- 小學生C++趣味編程從入門到精通