官术网_书友最值得收藏!

  • Deep Learning By Example
  • Ahmed Menshawy
  • 176字
  • 2021-06-24 18:52:45

Feature transformations

In the previous two sections, we covered reading the train and test sets and combining them. We also handled some missing values. Now, we will use the random forest classifier of scikit-learn to predict the survival of passengers. Different implementations of the random forest algorithm accept different types of data. The scikit-learn implementation of random forest accepts only numeric data. So, we need to transform the categorical features into numerical ones.

There are two types of features:

  • Quantitative: Quantitative features are measured in a numerical scale and can be meaningfully sorted. In the Titanic data samples, the Age feature is an example of a quantitative feature.

  • Qualitative: Qualitative variables, also called categorical variables, are variables that are not numerical. They describe data that fits into categories. In the Titanic data samples, the Embarked (indicates the name of the departure port) feature is an example of a qualitative feature.

We can apply different kinds of transformations to different variables. The following are some approaches that one can use to transform qualitative/categorical features.

主站蜘蛛池模板: 永德县| 台北县| 云南省| 广水市| 宁安市| 新郑市| 红河县| 苗栗县| 吉木萨尔县| 蒲江县| 青海省| 东辽县| 金华市| 新沂市| 高尔夫| 莫力| 恭城| 威宁| 甘孜县| 宜阳县| 安图县| 溧水县| 昌吉市| 措美县| 大埔县| 宝山区| 麻江县| 项城市| 满城县| 寻甸| 荥阳市| 河北省| 吉首市| 文安县| 郧西县| 龙胜| 桐柏县| 板桥市| 勃利县| 庆云县| 三台县|