官术网_书友最值得收藏!

The structure, or lack thereof, of data

When given a new dataset, it is first important to recognize whether or not your data is structured or unstructured:

  • Structured (organized) data: Data that can be broken down into observations and characteristics. They are generally organized using a tabular method (where rows are observations and columns are characteristics).

  • Unstructured (unorganized) data: Data that exists as a free-flowing entity and does not follow standard organizational hierarchy such as tabularity. Often, unstructured data appears to us as a blob of data, or as a single characteristic (column).

A few examples that highlight the difference between structured and unstructured data are as follows:

  • Data that exists in a raw free-text form, including server logs and tweets, are unstructured

  • Meteorological data, as reported by scientific instruments in precise movements, would be considered highly structured as they exist in a tabular row/column structure

主站蜘蛛池模板: 五台县| 八宿县| 孟连| 莎车县| 通山县| 大厂| 南华县| 建水县| 正宁县| 广元市| 凯里市| 特克斯县| 内江市| 南皮县| 南木林县| 巴南区| 沐川县| 孟津县| 根河市| 什邡市| 阿拉善右旗| 桂林市| 固始县| 邮箱| 神农架林区| 肇州县| 上犹县| 扶余县| 衡山县| 精河县| 泰州市| 邓州市| 华安县| 嘉义市| 宽甸| 清涧县| 财经| 营山县| 贵定县| 正镶白旗| 胶南市|