- Statistics for Data Science
- James D. Miller
- 254字
- 2021-07-02 14:58:52
Categorical data
Earlier, we explained how variables in your data can be either independent or dependent. Another type of variable definition is a categorical variable. This type of variable is one that can take on one of a limited, and typically fixed, number of possible values, thus assigning each individual to a particular category.
Often, the collected data's meaning is unclear. Categorical data is a method that a data scientist can use to put meaning to the data.
For example, if a numeric variable is collected (let's say the values found are 4, 10, and 12), the meaning of the variable becomes clear if the values are categorized. Let's suppose that based upon an analysis of how the data was collected, we can group (or categorize) the data by indicating that this data describes university students, and there is the following number of players:
- 4 tennis players
- 10 soccer players
- 12 football players
Now, because we grouped the data into categories, the meaning becomes clear.
Some other examples of categorized data might be individual pet preferences (grouped by the type of pet), or vehicle ownership (grouped by the style of a car owned), and so on.
So, categorical data, as the name suggests, is data grouped into some sort of category or multiple categories. Some data scientists refer to categories as sub-populations of data.
- AutoCAD快速入門與工程制圖
- 程序設(shè)計缺陷分析與實踐
- 物聯(lián)網(wǎng)與云計算
- 樂高創(chuàng)意機器人教程(中級 下冊 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實踐指導(dǎo)叢書)
- 分布式多媒體計算機系統(tǒng)
- 西門子S7-200 SMART PLC實例指導(dǎo)學(xué)與用
- 3D Printing for Architects with MakerBot
- 工業(yè)機器人操作與編程
- 基于Xilinx ISE的FPAG/CPLD設(shè)計與應(yīng)用
- 面向?qū)ο蟪绦蛟O(shè)計綜合實踐
- 機器人人工智能
- 算法設(shè)計與分析
- 深度學(xué)習(xí)實戰(zhàn)
- Windows Server 2012 Automation with PowerShell Cookbook
- 計算機導(dǎo)論:實訓(xùn)篇(第2版)