官术网_书友最值得收藏!

Unsupervised learning

Unsupervised learning is a form of machine learning in which the algorithm tries to detect/find patterns in data that do not have an outcome/target variable. In other words, we do not have data that comes with pre-existing labels. Thus, the algorithm will typically use a metric such as distance to group data together depending on how close they are to each other. 

As discussed in the previous section, most of the data that you will encounter in the real world will not come with a set of predefined labels and, as such, will only have a set of input features without a target attribute. 

In the following simple mathematical expression, U is the unsupervised learning algorithm, while X is a set of input features, such as weight and age:

Given this data, our objective is to create groups that could potentially be labeled as Healthy or Not Healthy. The unsupervised learning algorithm will use a metric such as distance in order to identify how close a set of points are to each other and how far apart two such groups are. The algorithm will then proceed to cluster these groups into two distinct groups, as illustrated in the following diagram:

Clustering two groups together 
主站蜘蛛池模板: 凤庆县| 大足县| 承德县| 海城市| 句容市| 滦南县| 抚宁县| 广南县| 五河县| 两当县| 许昌市| 彭水| 商城县| 康马县| 阿克| 石狮市| 和平县| 泽普县| 满洲里市| 顺昌县| 舒兰市| 望城县| 天柱县| 花垣县| 巨鹿县| 达日县| 金川县| 微博| 康保县| 库尔勒市| 隆尧县| 望谟县| 合肥市| 朝阳区| 扶绥县| 盐津县| 玉山县| 驻马店市| 青浦区| 西和县| 桂平市|