官术网_书友最值得收藏!

Introduction to big data modeling

Having a good idea of what big data and its characteristics are, let's now dig into what big data modeling is. Say we have the dataset, which we classify as big data, and before doing any analysis on the dataset, we need to have an idea of how the data looks. The goal of data modeling is to formally explore the nature of data so that you can figure out what kind of storage you need, and what kind of processing you can do on it.

Data modeling is a technique that helps to give meaningful insight into data by defining and categorizing it, and establishing official definitions and descriptors so that the data can be utilized by all information systems in a company.

We can hold at least two primary reasons for performing data modeling:

  • Strategic data modeling facilitates the overall information systems development strategy
  • Data modeling can help in the development of new databases

The data modeling for strategic outlining suggests defining what kind of data you will need for your company processes, while modeling in the context of analysis is more focused on representing data that exists and finding ways to classify it. In the case of big data, that process probably requires finding similarities between data from disparate sources and confirming that they, in fact, describe the same thing. In either case, the end goal is to generate a representation of your data that can be replicated in your database architecture.

主站蜘蛛池模板: 将乐县| 台中市| 扶风县| 泰安市| 扎赉特旗| 思南县| 罗田县| 宁蒗| 潜江市| 镇雄县| 南投县| 新蔡县| 平顺县| 堆龙德庆县| 五家渠市| 巴中市| 马公市| 普兰县| 富蕴县| 宁国市| 汉川市| 禄丰县| 塔城市| 襄汾县| 承德县| 凤城市| 慈溪市| 泽库县| 汝南县| 苏州市| 永定县| 和龙市| 仪征市| 扶绥县| 四子王旗| 巍山| 安国市| 晋江市| 中山市| 会宁县| 依兰县|