- Machine Learning for Healthcare Analytics Projects
- Eduonix Learning Solutions
- 253字
- 2021-06-24 18:21:42
Relationships between variables
We will now look at a scatterplot matrix, to see the relationships between some of these variables. A scatterplot matrix is a very useful function to use, because it can tell us whether a linear classifier will be a good classifier for our data, or whether we have to investigate more complicated methods.
We will add a scatter_matrix method and adjust the size to figsize(18, 18), to make it easier to see.
The output, as shown in the following screenshot, indicates the relationship between each variable and every other variable:
All of the variables are listed on both the x and the y axes. Where they intersect, we can see the histograms that we saw previously.
In the block indicated by the mouse cursor in the preceding screenshot, we can see that there is a pretty strong linear relationship between uniform_cell_shape and uniform_cell_size. This is expected. When we go through the preceding screenshot, we can see that some other cells have a good linear relationship. If we look at our classifications, however, there's no easy way to classify these relationships.
In class in the preceding screenshot, we can see that 4 is a malignant classification. We can also see that there are cells that are scored from 1 to 10 on clump_thickness, and were still classified as malignant.
Thus, we come to the conclusion that there aren't any strong relationships between any of the variables of our dataset.
- Linux Mint System Administrator’s Beginner's Guide
- 輕松學(xué)Java
- 圖解PLC控制系統(tǒng)梯形圖和語(yǔ)句表
- 21天學(xué)通Visual Basic
- 21天學(xué)通Java
- 自動(dòng)控制理論(非自動(dòng)化專(zhuān)業(yè))
- Learning C for Arduino
- Linux服務(wù)與安全管理
- 精通數(shù)據(jù)科學(xué):從線(xiàn)性回歸到深度學(xué)習(xí)
- Working with Linux:Quick Hacks for the Command Line
- Spatial Analytics with ArcGIS
- 從零開(kāi)始學(xué)JavaScript
- 網(wǎng)絡(luò)服務(wù)器搭建與管理
- 網(wǎng)絡(luò)信息安全項(xiàng)目教程
- ROS Robotics By Example(Second Edition)