- Java Data Analysis
- John R. Hubbard
- 212字
- 2021-07-02 18:21:46
Summary
This chapter discussed various organizational processes used to prepare data for analysis. When used in computer programs, each data value is assigned a data type, which characterizes the data and defines the kind of operations that can be performed upon it.
When stored in a relational database, data is organized into tables, in which each row corresponds to one data point, and where all the data in each column corresponds to a single field of a specified type. The key field(s) has unique values, which allows indexed searching.
A similar viewpoint is the organization of data into key-value pairs. As in relational database tables, the key fields must be unique. A hash table implements the key-value paradigm with a hash function that determines where the key's associated data is stored.
Data files are formatted according to their file type's specifications. The comma-separated value type (CSV) is one of the most common. Common structured data file types include XML and JSON.
The information that describes the structure of the data is called its metadata. That information is required for the automatic processing of the data.
Specific data processes described here include data cleaning and filtering (removing erroneous data), data scaling (adjusting numeric values according to a specified scale), sorting, merging, and hashing.
- TypeScript入門與實戰
- SoapUI Cookbook
- Mastering RabbitMQ
- INSTANT Weka How-to
- C語言程序設計
- TypeScript圖形渲染實戰:基于WebGL的3D架構與實現
- 微信小程序入門指南
- 現代C++編程實戰:132個核心技巧示例(原書第2版)
- 計算機應用基礎項目化教程
- SQL Server 2008中文版項目教程(第3版)
- Node.js區塊鏈開發
- Hacking Android
- IPython Interactive Computing and Visualization Cookbook
- Robot Framework Test Automation
- Python第三方庫開發應用實戰