官术网_书友最值得收藏!

Semi-structured

Semi-structured data refers to data that has both the elements of an organizational schema as well as aspects that are arbitrary. A personal phone diary (increasingly rare these days!) with columns for name, address, phone number, and notes could be considered a semi-structured dataset. The user might not be aware of the addresses of all inpiduals and hence some of the entries may have just a phone number and vice versa.

Similarly, the column for notes may contain additional descriptive information (such as a facsimile number, name of a relative associated with the inpidual, and so on). It is an arbitrary field that allows the user to add complementary information. The columns for name, address, and phone number can thus be considered structured in the sense that they can be presented in a tabular format, whereas the notes section is unstructured in the sense that it may contain an arbitrary set of descriptive information that cannot be represented in the other columns in the diary.

In computing, semi-structured data is usually represented by formats, such as JSON, that can encapsulate both structured as well as schemaless or arbitrary associations, generally using key-value pairs. A more common example could be email messages, which have both a structured part, such as name of the sender, time when the message was received, and so on, that is common to all email messages and an unstructured portion represented by the body or content of the email.

Platforms such as Mongo and CouchDB are generally used to store and query semi-structured datasets.

主站蜘蛛池模板: 赤峰市| 临沧市| 台中市| 安塞县| 苏尼特右旗| 巴楚县| 松原市| 永和县| 涞源县| 望江县| 宁远县| 正宁县| 南木林县| 民丰县| 那坡县| 峨眉山市| 阿坝| 清涧县| 肥乡县| 开鲁县| 巴青县| 西峡县| 班玛县| 临高县| 普兰店市| 新密市| 永安市| 营口市| 泉州市| 塔城市| 遂平县| 依安县| 安徽省| 西乌| 阳朔县| 依安县| 玉屏| 五莲县| 四会市| 湖北省| 长白|