官术网_书友最值得收藏!

Introduction

In the previous chapter, we saw how to build plots using the built-in function of pandas, and learned how to estimate the mean, median, and other descriptive statistics about specific consumer or product groups.

In this chapter, we will learn about clustering, a form of unsupervised learning technique, and then begin a discussion of how to calculate the similarity between two data points. Next, we will discuss how to standardize data so that multiple data features can be used without one overwhelming the others. We will also go through how similarity can be calculated by computing the distance between data points. Finally, we will discuss k-means clustering, how to perform it, and how to explore the resulting groups.

主站蜘蛛池模板: 大方县| 宁海县| 沐川县| 隆回县| 石河子市| 曲阜市| 肃北| 平江县| 内江市| 红河县| 仁寿县| 汝南县| 东兴市| 将乐县| 石台县| 福建省| 杭州市| 南安市| 根河市| 绿春县| 齐齐哈尔市| 江门市| 张家港市| 齐齐哈尔市| 报价| 沐川县| 安吉县| 桐梓县| 萨迦县| 墨竹工卡县| 兰考县| 南漳县| 麻阳| 永城市| 尚志市| 琼结县| 广灵县| 泉州市| 施秉县| 镶黄旗| 鸡泽县|