官术网_书友最值得收藏!

Loading the data

As for this data, it is the same that we used in Chapter 2, Linear Regression. What is different is that I've prepared the data exactly as before, but saved the features and response as an RData file. You can download that from GitHub: https://github.com/PacktPublishing/Advanced-Machine-Learning-with-R/blob/master/Data/amesDL.RData.

Once you have that in your working directory, load it into the environment:

> load("amesDL.RData")

Notice that you now have four new objects:

  • trained: The training data features
  • tested: The testing data features
  • train_logy: The log of home sales
  • test_logy: The log of home sales

It is essential that the data is centered and scaled for a neural network (in the prior exercise, all features were either zero or one, which is acceptable). To perform this task, a function is available in the caret package. Let's use the training data to create the mean and standard deviation values that we will apply to both train and test data:

> prep <- caret::preProcess(trained, method = c("center", "scale"))

> trainT <- predict(prep, trained)

This gives us our transformed training data. However, Keras will not accept a dataframe as an input. It needs an array for both the features and the response. This is an easy fix with the data.matrix() function:

> train_logy <- data.matrix(train_logy)

> trainT <- data.matrix(trainT)

Now, you can just repeat these steps with the test data features:

> testT <- predict(prep, tested)

> testT <- data.matrix(testT)

It's about to get interesting.

主站蜘蛛池模板: 屏南县| 阿拉尔市| 通辽市| 余姚市| 吉隆县| 车险| 河西区| 大连市| 新昌县| 清远市| 芦山县| 宜君县| 纳雍县| 西充县| 盐城市| 台南市| 绥江县| 榕江县| 永丰县| 苏尼特右旗| 德清县| 吉林省| 西吉县| 雅安市| 南丹县| 黄梅县| 西和县| 余姚市| 聂拉木县| 廊坊市| 应城市| 礼泉县| 大安市| 临武县| 环江| 诸暨市| 惠州市| 昌吉市| 兰西县| 黄石市| 柳江县|