官术网_书友最值得收藏!

Chapter 7. Data Analysis Application Examples

In this chapter, we want to get you acquainted with typical data preparation tasks and analysis techniques, because being fluent in preparing, grouping, and reshaping data is an important building block for successful data analysis.

While preparing data seems like a mundane task – and often it is – it is a step we cannot skip, although we can strive to simplify it by using tools such as Pandas.

Why is preparation necessary at all? Because most useful data will come from the real world and will have deficiencies, contain errors or will be fragmentary.

There are more reasons why data preparation is useful: it gets you in close contact with the raw material. Knowing your input helps you to spot potential errors early and build confidence in your results.

Here are a few data preparation scenarios:

  • A client hands you three files, each containing time series data about a single geological phenomenon, but the observed data is recorded on different intervals and uses different separators
  • A machine learning algorithm can only work with numeric data, but your input only contains text labels
  • You are handed the raw logs of a web server of an up and coming service and your task is to make suggestions on a growth strategy, based on existing visitor behavior
主站蜘蛛池模板: 贵南县| 吉安市| 怀安县| 临沂市| 博白县| 宜兰市| 海淀区| 苗栗县| 汶上县| 固始县| 遵义市| 巫溪县| 漯河市| 宝山区| 轮台县| 丰台区| 凯里市| 日照市| 昭觉县| 兰溪市| 沙湾县| 枣庄市| 长治县| 浏阳市| 海丰县| 北票市| 新乡市| 同心县| 金昌市| 大石桥市| 通许县| 浦县| 沂源县| 五河县| 鄂托克旗| 章丘市| 兴和县| 灵武市| 同仁县| 车险| 华容县|