官术网_书友最值得收藏!

Getting started

Before we get started with discussing the process of tidying data, it would be very prudent to point out that whatever you do to tidy your data, you should be sure to:

  1. Create and save your scripts so that you can use them again for new or similar data sources. This is referred to as reusability. Why spend time recreating the same code, rules, or logic if you don't have to? This applies to new data within the same project (that the scripts were developed for) or new projects you may be involved with in the future.
  2. Tidy your data as "far upstream" as possible, perhaps even at the original source. In other words, save and maintain the original data, but use programmatic scripts to clean it, fix mistakes, and save that cleaned dataset for further analysis.
主站蜘蛛池模板: 育儿| 开江县| 湟中县| 儋州市| 兴国县| 无锡市| 勐海县| 中方县| 安仁县| 宜城市| 宁津县| 海丰县| 淳安县| 民和| 宁南县| 博罗县| 晋州市| 沙坪坝区| 鹿邑县| 汾西县| 佛坪县| 凌云县| 曲麻莱县| 富阳市| 开化县| 远安县| 延安市| 探索| 兰溪市| 芒康县| 达尔| 宽城| 大同市| 锡林郭勒盟| 绥德县| 岐山县| 和林格尔县| 新泰市| 长岛县| 蒙山县| 永胜县|