官术网_书友最值得收藏!

Data manipulation, analysis, science, and pandas

We live in a world in which massive amounts of data are produced and stored every day. This data comes from a plethora of information systems, devices, and sensors. Almost everything you do, and items you use to do it, produces data which can be, or is, captured.

This has been greatly enabled by the ubiquitous nature of services that are connected to networks, and by the great increases in data storage facilities; this, combined with the ever-decreasing cost of storage, has made capturing and storing even the most trivial of data effective.

This has led to massive amounts of data being piled up and ready for access. But this data is spread out all over cyber-space, and is cannot actually be referred to as information. It tends to be a collected collection of the recording of events, whether financial, of your interactions with social networks, or of your personal health monitor tracking your heartbeat throughout the day. This data is stored in all kinds of formats, is located in scattered places, and beyond its raw nature does give much insight.

Logically, the overall process can be broken into three major areas of discipline:

  • Data manipulation
  • Data analysis
  • Data science

These three disciplines can and do have a lot of overlap. Where each ends and the others begin is open to interpretation. For the purposes of this book we will define each as in the following sections.

主站蜘蛛池模板: 阿鲁科尔沁旗| 乌拉特前旗| 安多县| 凯里市| 蒙城县| 罗甸县| 太保市| 河北省| 嵊州市| 合川市| 丘北县| 墨玉县| 永泰县| 蓝山县| 雷山县| 高陵县| 和林格尔县| 阿拉善右旗| 行唐县| 临猗县| 灵璧县| 交城县| 兴隆县| 宣威市| 梧州市| 铜陵市| 怀安县| 临夏市| 得荣县| 法库县| 资中县| 岢岚县| 扶绥县| 江城| 普兰店市| 赤水市| 莱西市| 图木舒克市| 上高县| 陇川县| 玛沁县|