官术网_书友最值得收藏!

Data visualization with machine learning

Let's get started with data visualization. We will plot histograms for each variable. The steps in the preceding section are important, because we need to understand these datasets if we want to accurately and effectively use machine learning. Otherwise, we're shooting in the dark, and we might spend time on a method that doesn't need to be investigated. We will use the plt method and make a plot, in which we will add the histograms of our dataset and edit the figure sizes, to make them easier to see.

We can see the output in the following screenshot:

As you can see, most of the preceding histograms have the majority of their data at around 1, with some data at a slightly higher value. Each histogram, apart from class, has at least one case where the value is 10. The histogram for clump thickness is pretty evenly distributed, while the histogram for chromatin is skewed to the left.

主站蜘蛛池模板: 黎平县| 宝山区| 东丰县| 化州市| 兴宁市| 湖北省| 南京市| 大宁县| 二连浩特市| 垦利县| 进贤县| 承德县| 嘉义县| 房山区| 南木林县| 荔浦县| 奉新县| 奉贤区| 三都| 大竹县| 无锡市| 大悟县| 古丈县| 河北区| 图们市| 甘泉县| 河北省| 宁国市| 子长县| 进贤县| 汤原县| 定南县| 武陟县| 玉溪市| 府谷县| 阿拉善左旗| 德清县| 宝应县| 鄂托克前旗| 霍林郭勒市| 化隆|