官术网_书友最值得收藏!

Understanding the k-NN algorithm

The k-NN algorithm is arguably one of the simplest machine learning algorithms. The reason for this is that we basically only need to store the training dataset. Then, in order to make a prediction for a new data point, we only need to find the closest data point in the training dataset-its nearest neighbor.

In a nutshell, the k-NN algorithm argues that a data point probably belongs to the same class as its neighbors. Think about it: if our neighbor is a Reds fan, we're probably Reds fans, too; otherwise we would have moved away a long time ago. The same can be said for the Blues.

Of course, some neighborhoods might be a little more complicated. In this case, we would not just consider our closest neighbor (where k=1), but instead our k nearest neighbors. To stick with our example as mentioned earlier, if we were Reds fans, we probably wouldn't move into a neighborhood where the majority of people are Blues fans.

That's all there is to it.

主站蜘蛛池模板: 永定县| 阿克陶县| 镇宁| 大港区| 屏边| 灵台县| 邵阳市| 台东县| 郧西县| 宽甸| 无为县| 搜索| 紫金县| 怀安县| 肥城市| 民和| 泰宁县| 康定县| 崇仁县| 昭通市| 彭阳县| 灵武市| 邓州市| 那坡县| 南平市| 莱州市| 普宁市| 南雄市| 固始县| 即墨市| 泸溪县| 仁寿县| 东莞市| 鄱阳县| 沈阳市| 汾西县| 石门县| 安顺市| 南岸区| 连云港市| 通州市|