官术网_书友最值得收藏!

Data visualization with machine learning

Let's get started with data visualization. We will plot histograms for each variable. The steps in the preceding section are important, because we need to understand these datasets if we want to accurately and effectively use machine learning. Otherwise, we're shooting in the dark, and we might spend time on a method that doesn't need to be investigated. We will use the plt method and make a plot, in which we will add the histograms of our dataset and edit the figure sizes, to make them easier to see.

We can see the output in the following screenshot:

As you can see, most of the preceding histograms have the majority of their data at around 1, with some data at a slightly higher value. Each histogram, apart from class, has at least one case where the value is 10. The histogram for clump thickness is pretty evenly distributed, while the histogram for chromatin is skewed to the left.

主站蜘蛛池模板: 玉溪市| 娄底市| 都兰县| 弥勒县| 成都市| 定安县| 乐业县| 上蔡县| 浙江省| 富蕴县| 开远市| 徐州市| 夏邑县| 泰安市| 出国| 会理县| 高唐县| 侯马市| 鄯善县| 舞钢市| 阜康市| 靖安县| 西丰县| 大同县| 沁源县| 苏尼特右旗| 宁陕县| 五常市| 柘城县| 柘荣县| 东源县| 丘北县| 石门县| 壤塘县| 神木县| 黄大仙区| 苏尼特左旗| 新巴尔虎右旗| 普兰县| 泉州市| 固镇县|