官术网_书友最值得收藏!

Summary

In this chapter, we discussed how hierarchical clustering works and where it may be best employed. In particular, we discussed various aspects of how clusters can be subjectively chosen through the evaluation of a dendrogram plot. This is a huge advantage compared to k-means clustering if you have absolutely no idea of what you're looking for in the data. Two key parameters that drive the success of hierarchical clustering were also discussed: the agglomerative versus divisive approach and linkage criteria. Agglomerative clustering takes a bottom-up approach by recursively grouping nearby data together until it results in one large cluster. Divisive clustering takes a top-down approach by starting with the one large cluster and recursively breaking it down until each data point falls into its own cluster. Divisive clustering has the potential to be more accurate since it has a complete view of the data from the start; however, it adds a layer of complexity that can decrease the stability and increase the runtime.

Linkage criteria grapples with the concept of how distance is calculated between candidate clusters. We have explored how centroids can make an appearance again beyond k-means clustering, as well as single and complete linkage criteria. Single linkage finds cluster distances by comparing the closest points in each cluster, while complete linkage finds cluster distances by comparing more distant points in each cluster. From the understanding that you have gained in this chapter, you are now able to evaluate how both k-means and hierarchical clustering can best fit the challenge that you are working on. In the next chapter, we will cover a clustering approach that will serve us best in the highly complex data: DBSCAN (Density-Based Spatial Clustering of Applications with Noise).

主站蜘蛛池模板: 蒲江县| 衡阳县| 姚安县| 安西县| 东阳市| 罗甸县| 冀州市| 鄂州市| 阿拉尔市| 南昌县| 太原市| 兴和县| 敦化市| 西宁市| 河曲县| 胶南市| 西充县| 贺兰县| 平安县| 平遥县| 剑河县| 洪雅县| 莱西市| 天津市| 宁乡县| 陈巴尔虎旗| 宿州市| 舞阳县| 浦东新区| 宝山区| 英超| 安顺市| 威海市| 苗栗县| 蓝山县| 沧州市| 嵩明县| 九江市| 乐山市| 沁阳市| 瓮安县|