- Data Analysis with IBM SPSS Statistics
- Kenneth Stehlik Barry Anthony J. Babinec
- 287字
- 2021-07-02 18:13:50
Discovering coding issues using frequencies
The frequency distribution for INCOME in the following screenshot demonstrates another reason why it is important to examine the pattern for individual data fields before diving into analytics more deeply. Navigate to Analyze | Descriptive Statistics | Frequencies, and select Respondents Income to build this table:

People are often reluctant to divulge their income so surveys typically ask them to select an income category like the groupings in this table. Notice, however, that the groups (numbered 1 through 26) represent unequal bands of income. The groups coded 3 thru 7 represent a range of only $1,000, while group 15 has a range of $5,000, and group 20 covers $15,000.
In the previous chapter, one of the properties of variables that was discussed involved the level of measurement. Income is an example of a scale variable and for these fields, it is assumed that the intervals between values are equal, which is not the case in this instance. The information in this table can be evaluated even though the categories are not uniform in terms of the dollar range they represent, but before using this field for statistical modeling, it is important to adjust the coding to reflect the actual differences in income.
In a later chapter, techniques for modifying fields will be covered but it is by examining the data along with the coding scheme used that you discover where the need for change exists.
- AngularJS深度剖析與最佳實踐
- Koa開發:入門、進階與實戰
- Building a Recommendation Engine with Scala
- 基于Swift語言的iOS App 商業實戰教程
- Mathematica Data Analysis
- 可解釋機器學習:模型、方法與實踐
- 用戶體驗增長:數字化·智能化·綠色化
- 飛槳PaddlePaddle深度學習實戰
- RabbitMQ Cookbook
- Swift語言實戰精講
- Go語言精進之路:從新手到高手的編程思想、方法和技巧(1)
- Scala Reactive Programming
- Python大學實用教程
- Flink技術內幕:架構設計與實現原理
- 現代C:概念剖析和編程實踐