官术网_书友最值得收藏!

Getting ready

We will use the os package in the operating system's dependent functionality, and the pandas package for data manipulation. 

Let's now take a look at the data definitions to understand our variables. In the following code, we list the data definition for a few variables. The dataset and the complete data definitions are available on GitHub. Here is an abridged version of the data description file:

MS SubClass (Nominal): Identifies the type of dwelling involved in the sale
Lot Frontage (Continuous): Linear feet of street connected to property
Alley (Nominal): Type of alley access to property
Overall Qual (Ordinal): Rates the overall material and finish of the house
Overall Cond (Ordinal): Rates the overall condition of the house
Year Built (Discrete): Original construction date
Mas Vnr Type (Nominal): Masonry veneer type
Mas Vnr Area (Continuous): Masonry veneer area in square feet
Garage Type (Nominal): Garage location
Garage Yr Blt (Discrete): Year garage was built
Garage Finish (Ordinal): Interior finish of the garage
Garage Cars (Discrete): Size of garage in car capacity
Garage Area (Continuous): Size of garage in square feet
Garage Qual (Ordinal): Garage quality
Garage Cond (Ordinal): Garage condition
...
...
SalePrice (Continuous): Sale price $$

We will then import the os and pandas packages and set our working directory according to our requirements, as seen in the following code block:

import os
import pandas as pd

# Set working directory as per your need
os.chdir(".../.../Chapter 1")
os.getcwd()

The next step is to download the dataset from GitHub and copy it to your working directory. 

主站蜘蛛池模板: 镇安县| 江都市| 宝丰县| 闵行区| 德保县| 商南县| 军事| 平潭县| 台前县| 五家渠市| 庆安县| 万载县| 无为县| 盱眙县| 天台县| 绥化市| 安图县| 太谷县| 昌吉市| 柳江县| 南召县| 松江区| 湟中县| 建阳市| 墨江| 大余县| 辽中县| 万山特区| 济南市| 吉木乃县| 九龙坡区| 山阴县| 桓台县| 玛纳斯县| 桐城市| 吉隆县| 仪征市| 博罗县| 鞍山市| 英山县| 同仁县|