官术网_书友最值得收藏!

  • Big Data Analysis with Python
  • Ivan Marin Ankit Shukla Sarang VK
  • 275字
  • 2021-06-11 13:46:40

Which Tool Should Be Used?

Seaborn tries to make the creation of some common analysis graphs easier than using Matplotlib directly. Matplotlib can be considered more low-level than Seaborn, and although this makes it a bit more cumbersome and verbose, it gives analysts much more flexibility. Some graphs, which with Seaborn are created with one function call, would take several lines of code to achieve using Matplotlib.

There is no rule to determine whether an analyst should use only the pandas plotting interface, Matplotlib directly, or Seaborn. Analysts should keep in mind the visualization requirements and the level of configuration required to create the desired graph.

Pandas' plotting interface is easier to use but is more constrained and limited. Seaborn has several graph patterns ready to use, including common statistical graphs such as pair plots and boxplots, but requires that the data is formatted into a tidy format and is more opinionated on how the graphs should look. Matplotlib is the base for both cases and is more flexible than both, but it demands a lot more code to create the same visualizations as the two other options.

The rule of thumb that we use in this book is: how can I create the graph that I need with the least amount of code and without changing the data? With that in mind, we will use the three options, sometimes at the same time, to attain our visualization goals. Analysts should not be restricted to just one of the options. We encourage the use of any tool that creates a meaningful visualization.

Let's go through the most common kinds of graphs used in statistical analysis.

主站蜘蛛池模板: 沙河市| 蓝田县| 神池县| 益阳市| 白城市| 德阳市| 汾阳市| 古田县| 湖北省| 大关县| 玉树县| 杭锦旗| 沂南县| 天峨县| 麦盖提县| 隆化县| 阿坝县| 岳池县| 仲巴县| 平昌县| 新化县| 泰安市| 东辽县| 肥乡县| 喜德县| 奉化市| 青海省| 巴南区| 江阴市| 和政县| 湖北省| 汪清县| 常宁市| 宁晋县| 五大连池市| 吴忠市| 长治县| 康保县| 东乌珠穆沁旗| 梁河县| 巨鹿县|