官术网_书友最值得收藏!

Data-Driven Feature Engineering

The previous section dealt with business-driven feature engineering. In addition to features we can derive from the business perspective, it would also be imperative to transform data through feature engineering from the perspective of data structures. We will look into different methods of identifying data structures and take a peek into some data transformation techniques.

A Quick Peek at Data Types and a Descriptive Summary

Looking at the data types such as categorical or numeric and then deriving summary statistics is a good way to take a quick peek into data before you do some of the downstream feature engineering steps. Let's take a look at an example from our dataset:

# Looking at Data types

print(bankData.dtypes)

# Looking at descriptive statistics

print(bankData.describe())

You should get the following output:

Figure 3.28: Output showing the different data types in the dataset

In the preceding output, you see the different types of information in the dataset and its corresponding data types. For instance, age is an integer and so is day.

The following output is that of a descriptive summary statistic, which displays some of the basic measures such as mean, standard deviation, count, and the quantile values of the respective features:

Figure 3.29: Data types and a descriptive summary

The purpose of a descriptive summary is to get a quick feel of the data with respect to the distribution and some basic statistics such as mean and standard deviation. Getting a perspective on the summary statistics is critical for thinking about what kind of transformations are required for each variable.

For instance, in the earlier exercises, we converted the numerical data into categorical variables based on the quantile values. Intuitions for transforming variables would come from the quick summary statistics that we can derive from the dataset.

In the following sections, we will be looking at the correlation matrix and visualization.

主站蜘蛛池模板: 那坡县| 汕头市| 郸城县| 凤翔县| 萨迦县| 阳新县| 广宗县| 特克斯县| 遵化市| 永仁县| 琼海市| 明溪县| 黑水县| 常熟市| 聂拉木县| 井冈山市| 苗栗市| 兰溪市| 大方县| 阿尔山市| 罗田县| 华蓥市| 阿克苏市| 莆田市| 汝南县| 渝中区| 平昌县| 平潭县| 仙游县| 合肥市| 厦门市| 锦州市| 宜兴市| 南昌县| 定州市| 云林县| 洱源县| 虞城县| 灌南县| 肥西县| 天水市|