官术网_书友最值得收藏!

Making sense of data

It is crucial to identify the type of data under analysis. In this section, we are going to learn about different types of data that you can encounter during analysis. Different disciplines store different kinds of data for different purposes. For example, medical researchers store patients' data, universities store students' and teachers' data, and real estate industries storehouse and building datasets. A dataset contains many observations about a particular object. For instance, a dataset about patients in a hospital can contain many observations. A patient can be described by a patient identifier (ID), name, address, weight, date of birth, address, email, and gender. Each of these features that describes a patient is a variable. Each observation can have a specific value for each of these variables. For example, a patient can have the following:

PATIENT_ID = 1001
Name = Yoshmi Mukhiya
Address = Mannsverk 61, 5094, Bergen, Norway
Date of birth = 10th July 2018
Email = yoshmimukhiya@gmail.com
Weight = 10
Gender = Female

These datasets are stored in hospitals and are presented for analysis. Most of this data is stored in some sort of database management system in tables/schema. An example of a table for storing patient information is shown here:

            
PATIENT_ID           NAME           ADDRESS           DOB           EMAIL           Gender           WEIGHT
001           Suresh Kumar Mukhiya           Mannsverk, 61           30.12.1989           skmu@hvl.no           Male           68
002           Yoshmi Mukhiya           Mannsverk 61, 5094, Bergen           10.07.2018           yoshmimukhiya@gmail.com           Female           1
003           Anju Mukhiya           Mannsverk 61, 5094, Bergen           10.12.1997           anjumukhiya@gmail.com           Female           24
004           Asha Gaire           Butwal, Nepal           30.11.1990           aasha.gaire@gmail.com           Female           23
005           Ola Nordmann           Danmark, Sweden           12.12.1789           ola@gmail.com           Male           75

 

To summarize the preceding table, there are four observations (001, 002, 003, 004, 005). Each observation describes variables (PatientID, name, address, dob, email, gender, and weight). Most of the dataset broadly falls into two groups—numerical data and categorical data. 

主站蜘蛛池模板: 綦江县| 凉城县| 顺昌县| 呼和浩特市| 新河县| 竹山县| 仲巴县| 阿图什市| 和顺县| 安远县| 安远县| 张家港市| 望城县| 望城县| 常德市| 诸暨市| 延吉市| 白水县| 天气| 木里| 富宁县| 益阳市| 章丘市| 玉屏| 保山市| 虞城县| 义乌市| 阜康市| 广水市| 关岭| 确山县| 黄骅市| 安庆市| 潞西市| 阳高县| 花垣县| 措美县| 迁安市| 齐齐哈尔市| 丰都县| 襄汾县|