官术网_书友最值得收藏!

  • Deep Learning By Example
  • Ahmed Menshawy
  • 176字
  • 2021-06-24 18:52:45

Feature transformations

In the previous two sections, we covered reading the train and test sets and combining them. We also handled some missing values. Now, we will use the random forest classifier of scikit-learn to predict the survival of passengers. Different implementations of the random forest algorithm accept different types of data. The scikit-learn implementation of random forest accepts only numeric data. So, we need to transform the categorical features into numerical ones.

There are two types of features:

  • Quantitative: Quantitative features are measured in a numerical scale and can be meaningfully sorted. In the Titanic data samples, the Age feature is an example of a quantitative feature.

  • Qualitative: Qualitative variables, also called categorical variables, are variables that are not numerical. They describe data that fits into categories. In the Titanic data samples, the Embarked (indicates the name of the departure port) feature is an example of a qualitative feature.

We can apply different kinds of transformations to different variables. The following are some approaches that one can use to transform qualitative/categorical features.

主站蜘蛛池模板: 班玛县| 元江| 嘉善县| 石城县| 内丘县| 新郑市| 阿坝县| 凯里市| 远安县| 罗甸县| 松溪县| 西充县| 林周县| 祁连县| 翼城县| 土默特左旗| 大竹县| 曲松县| 仪征市| 琼结县| 佛山市| 承德市| 文安县| 镇康县| 五台县| 米脂县| 博野县| 贺州市| 罗城| 临颍县| 阳江市| 阳城县| 通辽市| 淳安县| 松滋市| 宾阳县| 彭水| 酒泉市| 靖宇县| 微博| 喀喇沁旗|