官术网_书友最值得收藏!

Data Munging

We are just getting into the action with data! In this chapter, you'll learn how to munge data. What does data munging mean ?

The term mung is a technical term that was coined about half a century ago by students of at Massachusetts Institute of Technology (MIT). Munging means to change, in a series of well-specified and reversible steps, a piece of original data to a completely different (and hopefully more useful) one. Deep-rooted in hacker culture, munging is often described in the data science pipeline using other, almost synonymous, terms such as data wrangling or data preparation.

Given such premises, in this chapter, the following topics will be covered:

  • The data science process (so that you'll know what is going on and what's next)
  • Uploading data from a file
  • Selecting the data you need
  • Cleaning up any missing or wrong data
  • Adding, inserting, and deleting data
  • Grouping and transforming data to obtain new and meaningful information
  • Managing to obtain a dataset matrix or an array to feed into the data science pipeline
主站蜘蛛池模板: 广昌县| 巴东县| 曲水县| 龙陵县| 仁化县| 抚松县| 稻城县| 大丰市| 镇宁| 苍梧县| 定襄县| 大新县| 洪洞县| 唐河县| 乌拉特中旗| 龙游县| 雅江县| 宁河县| 武义县| 大余县| 云南省| 麻江县| 措勤县| 天峨县| 安阳市| 富源县| 新余市| 玉龙| 阿巴嘎旗| 鄂伦春自治旗| 龙泉市| 黑龙江省| 大田县| 新营市| 天柱县| 南昌市| 出国| 静乐县| 贵南县| 忻城县| 酉阳|