官术网_书友最值得收藏!

KNN cons

  • The algorithm is fast for training but slow for inference.
  • You need to choose the best k somehow (see Choosing a good k section).
  • With the small values of k, the model can be badly affected by outliers; in other words, it's prone to overfitting.
  • You need to choose a distance metric. For usual real value features, one can choose among many available options (see Calculating the distance section) resulting in different closest neighbors. The metric used by default in many machine learning packages is the Euclidean distance; however, this choice is nothing more than a tradition and for many applications is not the optimal.
  • Model size grows with the new data incorporated.
  • What should we do if there are several identical samples with different labels? In this case, the result can be different depending on the order in which samples are stored.
  • The model suffers from the curse of dimensionality.
主站蜘蛛池模板: 甘谷县| 鱼台县| 固安县| 永济市| 大化| 建始县| 南陵县| 东辽县| 浑源县| 松桃| 蒙自县| 古丈县| 安丘市| 朝阳县| 科技| 射洪县| 通河县| 堆龙德庆县| 延长县| 凌海市| 虎林市| 新野县| 甘泉县| 任丘市| 霞浦县| 旌德县| 柞水县| 莱芜市| 茌平县| 阳山县| 安龙县| 榆社县| 吉林市| 扎鲁特旗| 合川市| 信丰县| 乳山市| 新绛县| 永仁县| 秦安县| 芦溪县|