官术网_书友最值得收藏!

Clustering

Typically, when people talk about unsupervised learning, they talk about cluster analysis or clustering. A cluster analysis algorithm takes a set of data points and tries to categorize them into groups such that similar items belong to the same group, and different items do not. There are many ways where it can be used, for example, in customer segmentation or text categorization.

Customer segmentation is an example of clustering. Given some description of customers, we try to put them into groups such that the customers in one group have similar profiles and behave in a similar way. This information can be used to understand what do the people in these groups want, and this can be used to target them with better advertisements and other promotional messages.

Another example is text categorization. Given a collection of texts, we would like to find common topics among these texts and arrange the texts according to these topics. For example, given a set of complaints in an e-commerce store, we may want to put ones that talk about similar things together, and this should help the users of the system navigate through the complaints easier.

Examples of cluster analysis algorithms are hierarchical clustering, k-means, density-based spatial clustering of applications with noise (DBSCAN), and many others. We will talk about clustering in detail in the first part of Chapter 5, Unsupervised Learning - Clustering and Dimensionality Reduction.

主站蜘蛛池模板: 碌曲县| 阿坝县| 巫山县| 通河县| 长岭县| 达日县| 四川省| 泉州市| 西吉县| 台山市| 宣恩县| 鸡泽县| 广元市| 长兴县| 天全县| 密山市| 东乡族自治县| 惠州市| 木兰县| 志丹县| 伊金霍洛旗| 岐山县| 满洲里市| 色达县| 河南省| 靖安县| 平果县| 博湖县| 隆尧县| 安塞县| 台山市| 射阳县| 灌云县| 福鼎市| 托里县| 常德市| 鄯善县| 丰宁| 措勤县| 南江县| 黎川县|