官术网_书友最值得收藏!

Semi-structured

Semi-structured data refers to data that has both the elements of an organizational schema as well as aspects that are arbitrary. A personal phone diary (increasingly rare these days!) with columns for name, address, phone number, and notes could be considered a semi-structured dataset. The user might not be aware of the addresses of all inpiduals and hence some of the entries may have just a phone number and vice versa.

Similarly, the column for notes may contain additional descriptive information (such as a facsimile number, name of a relative associated with the inpidual, and so on). It is an arbitrary field that allows the user to add complementary information. The columns for name, address, and phone number can thus be considered structured in the sense that they can be presented in a tabular format, whereas the notes section is unstructured in the sense that it may contain an arbitrary set of descriptive information that cannot be represented in the other columns in the diary.

In computing, semi-structured data is usually represented by formats, such as JSON, that can encapsulate both structured as well as schemaless or arbitrary associations, generally using key-value pairs. A more common example could be email messages, which have both a structured part, such as name of the sender, time when the message was received, and so on, that is common to all email messages and an unstructured portion represented by the body or content of the email.

Platforms such as Mongo and CouchDB are generally used to store and query semi-structured datasets.

主站蜘蛛池模板: 平罗县| 邯郸县| 濉溪县| 海阳市| 安吉县| 革吉县| 麻江县| 资阳市| 宿松县| 富蕴县| 安远县| 松溪县| 霸州市| 犍为县| 平潭县| 外汇| 兰州市| 平阴县| 凌云县| 孟津县| 固阳县| 东莞市| 集安市| 淳化县| 渝北区| 夏河县| 涟水县| 平江县| 时尚| 当阳市| 密云县| 翼城县| 农安县| 获嘉县| 全南县| 吉水县| 五莲县| 大冶市| 石渠县| 湄潭县| 日照市|