官术网_书友最值得收藏!

Dimensionality reduction

Dimensionality reduction is used to reduce the dimensionality of a dataset. It is really helpful in cases where the problem becomes intractable, when the number of variables increases. By using the term dimensionality, we are referring to the features. One of the basic reduction techniques is feature engineering.

Generally, we have many dimensionality reduction algorithms:

  • Low variance filter: Dropping variables that have low variance, compared to others.
  • High correlation filter: This identifies the variables with high correlation, by using pearson or polychoric, and selects one of them using the Variance Inflation Factor (VIF).
  • Backward feature elimination: This is done by computing the sum of square of error (SSE) after eliminating each variable n times.
  • Linear Discriminant Analysis (LDA): This reduces the number of dimensions, n, from the original to the number of classes?—?1 number of features.
  • Principal Component Analysis (PCA): This is a statistical procedure that transforms variables into a new set of variables (principle components).
主站蜘蛛池模板: 南京市| 革吉县| 韩城市| 宁陵县| 桓台县| 定陶县| 深水埗区| 收藏| 长武县| 五莲县| 土默特左旗| 鄂托克旗| 丰顺县| 普兰县| 东城区| 淮阳县| 都兰县| 寿宁县| 万盛区| 大同市| 长垣县| 隆化县| 濮阳市| 荆门市| 南投市| 沁源县| 郎溪县| 嘉善县| 竹山县| 吴川市| 临澧县| 禹州市| 古丈县| 上林县| 金平| 呼图壁县| 南召县| 建湖县| 额敏县| 华阴市| 濮阳县|