- Machine Learning for Healthcare Analytics Projects
- Eduonix Learning Solutions
- 253字
- 2021-06-24 18:21:42
Relationships between variables
We will now look at a scatterplot matrix, to see the relationships between some of these variables. A scatterplot matrix is a very useful function to use, because it can tell us whether a linear classifier will be a good classifier for our data, or whether we have to investigate more complicated methods.
We will add a scatter_matrix method and adjust the size to figsize(18, 18), to make it easier to see.
The output, as shown in the following screenshot, indicates the relationship between each variable and every other variable:
All of the variables are listed on both the x and the y axes. Where they intersect, we can see the histograms that we saw previously.
In the block indicated by the mouse cursor in the preceding screenshot, we can see that there is a pretty strong linear relationship between uniform_cell_shape and uniform_cell_size. This is expected. When we go through the preceding screenshot, we can see that some other cells have a good linear relationship. If we look at our classifications, however, there's no easy way to classify these relationships.
In class in the preceding screenshot, we can see that 4 is a malignant classification. We can also see that there are cells that are scored from 1 to 10 on clump_thickness, and were still classified as malignant.
Thus, we come to the conclusion that there aren't any strong relationships between any of the variables of our dataset.
- AutoCAD繪圖實用速查通典
- Verilog HDL數(shù)字系統(tǒng)設(shè)計入門與應(yīng)用實例
- Java實用組件集
- 自動控制原理
- Learning Apache Cassandra(Second Edition)
- 人工智能工程化:應(yīng)用落地與中臺構(gòu)建
- 21天學(xué)通ASP.NET
- 21天學(xué)通C#
- AI 3.0
- 觸控顯示技術(shù)
- 基于32位ColdFire構(gòu)建嵌入式系統(tǒng)
- Mastering pfSense
- Hands-On Data Warehousing with Azure Data Factory
- 自動化生產(chǎn)線安裝與調(diào)試(三菱FX系列)(第二版)
- 3ds Max造型表現(xiàn)藝術(shù)