官术网_书友最值得收藏!

Which Tool Should Be Used?

Seaborn tries to make the creation of some common analysis graphs easier than using Matplotlib directly. Matplotlib can be considered more low-level than Seaborn, and although this makes it a bit more cumbersome and verbose, it gives analysts much more flexibility. Some graphs, which with Seaborn are created with one function call, would take several lines of code to achieve using Matplotlib.

There is no rule to determine whether an analyst should use only the pandas plotting interface, Matplotlib directly, or Seaborn. Analysts should keep in mind the visualization requirements and the level of configuration required to create the desired graph.

Pandas' plotting interface is easier to use but is more constrained and limited. Seaborn has several graph patterns ready to use, including common statistical graphs such as pair plots and boxplots, but requires that the data is formatted into a tidy format and is more opinionated on how the graphs should look. Matplotlib is the base for both cases and is more flexible than both, but it demands a lot more code to create the same visualizations as the two other options.

The rule of thumb that we use in this book is: how can I create the graph that I need with the least amount of code and without changing the data? With that in mind, we will use the three options, sometimes at the same time, to attain our visualization goals. Analysts should not be restricted to just one of the options. We encourage the use of any tool that creates a meaningful visualization.

Let's go through the most common kinds of graphs used in statistical analysis.

主站蜘蛛池模板: 福贡县| 恩施市| 赫章县| 和静县| 和林格尔县| 肥城市| 雷山县| 沂南县| 莆田市| 成都市| 永宁县| 尉氏县| 云南省| 万载县| 宿州市| 济阳县| 美姑县| 哈密市| 施秉县| 红河县| 玉门市| 项城市| 秦皇岛市| 莱芜市| 赤水市| 宜宾县| 深水埗区| 陇南市| 夏津县| 临桂县| 吉木萨尔县| 龙山县| 商南县| 海伦市| 南通市| 五莲县| 兴文县| 永德县| 勐海县| 双峰县| 武川县|