官术网_书友最值得收藏!

Introduction

In the previous chapter, we saw how to build plots using the built-in function of pandas, and learned how to estimate the mean, median, and other descriptive statistics about specific consumer or product groups.

In this chapter, we will learn about clustering, a form of unsupervised learning technique, and then begin a discussion of how to calculate the similarity between two data points. Next, we will discuss how to standardize data so that multiple data features can be used without one overwhelming the others. We will also go through how similarity can be calculated by computing the distance between data points. Finally, we will discuss k-means clustering, how to perform it, and how to explore the resulting groups.

主站蜘蛛池模板: 和政县| 安泽县| 祁阳县| 霞浦县| 赤城县| 宣武区| 渭南市| 方正县| 嘉祥县| 天峻县| 游戏| 瓦房店市| 措美县| 紫阳县| 克拉玛依市| 阳江市| 西昌市| 惠安县| 原平市| 阿克| 闵行区| 特克斯县| 岳池县| 临沂市| 海南省| 兴山县| 巴林左旗| 兴山县| 花莲市| 桃源县| 错那县| 富平县| 岚皋县| 邯郸县| 拜泉县| 临汾市| 如皋市| 玉山县| 楚雄市| 南宁市| 宾川县|