- Machine Learning for Healthcare Analytics Projects
- Eduonix Learning Solutions
- 253字
- 2021-06-24 18:21:42
Relationships between variables
We will now look at a scatterplot matrix, to see the relationships between some of these variables. A scatterplot matrix is a very useful function to use, because it can tell us whether a linear classifier will be a good classifier for our data, or whether we have to investigate more complicated methods.
We will add a scatter_matrix method and adjust the size to figsize(18, 18), to make it easier to see.
The output, as shown in the following screenshot, indicates the relationship between each variable and every other variable:
All of the variables are listed on both the x and the y axes. Where they intersect, we can see the histograms that we saw previously.
In the block indicated by the mouse cursor in the preceding screenshot, we can see that there is a pretty strong linear relationship between uniform_cell_shape and uniform_cell_size. This is expected. When we go through the preceding screenshot, we can see that some other cells have a good linear relationship. If we look at our classifications, however, there's no easy way to classify these relationships.
In class in the preceding screenshot, we can see that 4 is a malignant classification. We can also see that there are cells that are scored from 1 to 10 on clump_thickness, and were still classified as malignant.
Thus, we come to the conclusion that there aren't any strong relationships between any of the variables of our dataset.
- 模型制作
- Security Automation with Ansible 2
- 數據產品經理:解決方案與案例分析
- STM32G4入門與電機控制實戰:基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現
- STM32嵌入式微控制器快速上手
- Maya極速引擎:材質篇
- Windows 7寶典
- CompTIA Network+ Certification Guide
- Cloudera Administration Handbook
- ESP8266 Home Automation Projects
- Blender 3D Printing by Example
- Red Hat Linux 9實務自學手冊
- Web編程基礎
- Photoshop CS4數碼照片處理入門、進階與提高
- Getting Started with Tableau 2019.2