官术网_书友最值得收藏!

Unsupervised learning

Unsupervised learning is a form of machine learning in which the algorithm tries to detect/find patterns in data that do not have an outcome/target variable. In other words, we do not have data that comes with pre-existing labels. Thus, the algorithm will typically use a metric such as distance to group data together depending on how close they are to each other. 

As discussed in the previous section, most of the data that you will encounter in the real world will not come with a set of predefined labels and, as such, will only have a set of input features without a target attribute. 

In the following simple mathematical expression, U is the unsupervised learning algorithm, while X is a set of input features, such as weight and age:

Given this data, our objective is to create groups that could potentially be labeled as Healthy or Not Healthy. The unsupervised learning algorithm will use a metric such as distance in order to identify how close a set of points are to each other and how far apart two such groups are. The algorithm will then proceed to cluster these groups into two distinct groups, as illustrated in the following diagram:

Clustering two groups together 
主站蜘蛛池模板: 杂多县| 衡东县| 剑阁县| 那坡县| 虎林市| 永兴县| 溆浦县| 南丰县| 西乡县| 石棉县| 汨罗市| 广平县| 巨鹿县| 大丰市| 息烽县| 神池县| 亚东县| 宁阳县| 聂拉木县| 唐海县| 凭祥市| 福清市| 南安市| 微山县| 无极县| 沽源县| 文水县| 兴化市| 玉田县| 中江县| 宜宾县| 衡南县| 阿拉善右旗| 马鞍山市| 盱眙县| 安多县| 静乐县| 常山县| 且末县| 明星| 星子县|