- Feature Engineering Made Easy
- Sinan Ozdemir Divya Susarla
- 319字
- 2021-06-25 22:45:53
Quantitative versus qualitative data
To accomplish our diagnoses of the various types of data, we will begin with the highest order of separation. When dealing with structured, tabular data (which we usually will be doing), the first question we generally ask ourselves is whether the values are of a numeric or categorical nature.
Quantitative data are data that are numerical in nature. They should be measuring the quantity of something.
Qualitative data are data that are categorical in nature. They should be describing the quality of something.
Basic examples:
- Weather measured as temperature in Fahrenheit or Celsius would be quantitative
- Weather measured as cloudy or sunny would be qualitative
- The name of a person visiting the White House would be qualitative
- The amount of blood you donate at a blood drive is quantitative
The first two examples show that we can describe similar systems using data from both the qualitative and quantitative side. In fact, in most datasets, we will be working with both qualitative and quantitative data.
Sometimes, data can, arguably, be either quantitative or qualitative. The ranking you would give a restaurant (one through five stars) could be considered quantitative or qualitative, for example. While they are numbers, the numbers themselves might also represent categories. For example, if the restaurant rating app asked you to rate the restaurant using a quantitative star system, then feasibly the restaurant's average ranking might be a decimal, like 4.71 stars, making the data quantitative. At the same time, if the app asked you if you hated it, thought it was OK, liked it, loved it, or really loved it, then these are now categories. As a result of these ambiguities between quantitative and qualitative data, we employ an even deeper method called the four levels of data. Before we do that, let's introduce our first dataset for the chapter and really solidify some examples of qualitative and quantitative data.
- 數(shù)據(jù)產(chǎn)品經(jīng)理高效學習手冊:產(chǎn)品設計、技術常識與機器學習
- App+軟件+游戲+網(wǎng)站界面設計教程
- Learning Spring Boot
- 文本數(shù)據(jù)挖掘:基于R語言
- 數(shù)據(jù)庫系統(tǒng)原理及應用教程(第4版)
- 智能數(shù)據(jù)分析:入門、實戰(zhàn)與平臺構建
- 基于Apache CXF構建SOA應用
- 數(shù)據(jù)庫程序員面試筆試真題庫
- 數(shù)據(jù)庫技術及應用教程
- 圖數(shù)據(jù)實戰(zhàn):用圖思維和圖技術解決復雜問題
- 重復數(shù)據(jù)刪除技術:面向大數(shù)據(jù)管理的縮減技術
- 貫通SQL Server 2008數(shù)據(jù)庫系統(tǒng)開發(fā)
- 爬蟲實戰(zhàn):從數(shù)據(jù)到產(chǎn)品
- 大數(shù)據(jù)與機器學習:實踐方法與行業(yè)案例
- Access 2016數(shù)據(jù)庫應用基礎