- Feature Engineering Made Easy
- Sinan Ozdemir Divya Susarla
- 150字
- 2021-06-25 22:45:53
The structure, or lack thereof, of data
When given a new dataset, it is first important to recognize whether or not your data is structured or unstructured:
Structured (organized) data: Data that can be broken down into observations and characteristics. They are generally organized using a tabular method (where rows are observations and columns are characteristics).
Unstructured (unorganized) data: Data that exists as a free-flowing entity and does not follow standard organizational hierarchy such as tabularity. Often, unstructured data appears to us as a blob of data, or as a single characteristic (column).
A few examples that highlight the difference between structured and unstructured data are as follows:
Data that exists in a raw free-text form, including server logs and tweets, are unstructured
Meteorological data, as reported by scientific instruments in precise movements, would be considered highly structured as they exist in a tabular row/column structure
- SQL入門經典(第5版)
- Word 2010中文版完全自學手冊
- Access 2007數據庫應用上機指導與練習
- 大數據:規劃、實施、運維
- 大數據算法
- R數據科學實戰:工具詳解與案例分析(鮮讀版)
- Mastering Machine Learning with R(Second Edition)
- Oracle 12c云數據庫備份與恢復技術
- SQL優化最佳實踐:構建高效率Oracle數據庫的方法與技巧
- Unreal Engine Virtual Reality Quick Start Guide
- MySQL數據庫技術與應用
- Expert Python Programming(Third Edition)
- The Natural Language Processing Workshop
- Internet of Things with Python
- SQL Server 2008寶典(第2版)