官术网_书友最值得收藏!

Unsupervised learning

Unsupervised learning is a form of machine learning in which the algorithm tries to detect/find patterns in data that do not have an outcome/target variable. In other words, we do not have data that comes with pre-existing labels. Thus, the algorithm will typically use a metric such as distance to group data together depending on how close they are to each other. 

As discussed in the previous section, most of the data that you will encounter in the real world will not come with a set of predefined labels and, as such, will only have a set of input features without a target attribute. 

In the following simple mathematical expression, U is the unsupervised learning algorithm, while X is a set of input features, such as weight and age:

Given this data, our objective is to create groups that could potentially be labeled as Healthy or Not Healthy. The unsupervised learning algorithm will use a metric such as distance in order to identify how close a set of points are to each other and how far apart two such groups are. The algorithm will then proceed to cluster these groups into two distinct groups, as illustrated in the following diagram:

Clustering two groups together 
主站蜘蛛池模板: 鹿邑县| 根河市| 锦州市| 高密市| 小金县| 仪陇县| 杂多县| 潼南县| 上蔡县| 朝阳市| 大新县| 涟源市| 靖江市| 鲜城| 治多县| 灌云县| 乌鲁木齐县| 邵武市| 乌审旗| 孝昌县| 航空| 诏安县| 内黄县| 青冈县| 阆中市| 武城县| 叶城县| 秦皇岛市| 阳江市| 南部县| 祁阳县| 康定县| 深圳市| 龙门县| 德阳市| 九龙坡区| 宁陕县| 鹤庆县| 三穗县| 渝中区| 依兰县|