官术网_书友最值得收藏!

  • Machine Learning in Java
  • AshishSingh Bhatia Bostjan Kaluza
  • 148字
  • 2021-06-10 19:29:58

The curse of dimensionality

The curse of dimensionality refers to a situation where we have a large number of features, often hundreds or thousands, which lead to an extremely large space with sparse data and, consequently, to distance anomalies. For instance, in high dimensions, almost all pairs of points are equally distant from each other; in fact, almost all of the pairs have distance close to the average distance. Another manifestation of the curse is that any two vectors are almost orthogonal, which means all of the angles are close to 90 degrees. This practically makes any distance measurement useless.

A cure for the curse of dimensionality might be found in one of the data reduction techniques, where we want to reduce the number of features; for instance, we can run a feature selection algorithm, such as ReliefF, or a feature extraction or reduction algorithm, such as PCA.

主站蜘蛛池模板: 文水县| 鄂托克旗| 吉安县| 铅山县| 罗甸县| 和硕县| 广灵县| 宜兴市| 莆田市| 东兰县| 石渠县| 台南市| 碌曲县| 萨嘎县| 扶绥县| 犍为县| 罗平县| 九龙坡区| 西林县| 婺源县| 茌平县| 涡阳县| 城市| 包头市| 都匀市| 丹阳市| 乃东县| 衡阳市| 赤峰市| 红安县| 杭锦后旗| 平昌县| 尤溪县| 梁平县| 三江| 章丘市| 珠海市| 房山区| 峨眉山市| 沁水县| 慈溪市|