官术网_书友最值得收藏!

Building blocks of big data analytics

At a fundamental level, big data systems can be considered to have four major layers, each of which are indispensable. There are many such layers that are outlined in various textbooks and literature and, as such, it can be ambiguous. Nevertheless, at a high level, the layers defined here are both intuitive and simplistic:

Big Data Analytics Layers

The levels are broken down as follows:

  • Hardware: Servers that provide the computing backbone, storage devices that store the data, and network connectivity across different server components are some of the elements that define the hardware stack. In essence, the systems that provide the computational and storage capabilities and systems that support the interoperability of these devices form the foundational layer of the building blocks.
  • Software: Software resources that facilitate analytics on the datasets hosted in the hardware layer, such as Hadoop and NoSQL systems, represent the next level in the big data stack. Analytics software can be classified into various subpisions. Two of the primary high-level classifications for analytics software are tools that facilitate are:
    • Data mining: Software that provides facilities for aggregations, joins across datasets, and pivot tables on large datasets fall into this category. Standard NoSQL platforms such as Cassandra, Redis, and others are high-level, data mining tools for big data analytics.
    • Statistical analytics: Platforms that provide analytics capabilities beyond simple data mining, such as running algorithms that can range from simple regressions to advanced neural networks such as Google TensorFlow or R, fall into this category.
  • Data management: Data encryption, governance, access, compliance, and other features salient to any enterprise and production environment to manage and, in some ways, reduce operational complexity form the next basic layer. Although they are less tangible than hardware or software, data management tools provide a defined framework, using which organizations can fulfill their obligations such as security and compliance.
  • End user: The end user of the analytics software forms the final aspect of a big data analytics engagement. A data platform, after all, is only as good as the extent to which it can be leveraged efficiently and addresses business-specific use cases. This is where the role of the practitioner who makes use of the analytics platform to derive value comes into play. The term data scientist is often used to denote inpiduals who implement the underlying big data analytics capabilities while business users reap the benefits of faster access and analytics capabilities not available in traditional systems.
主站蜘蛛池模板: 徐汇区| 枞阳县| 库尔勒市| 辽中县| 分宜县| 太仓市| 黄浦区| 彩票| 房产| 宁陕县| 内江市| 增城市| 崇阳县| 方正县| 仙桃市| 额敏县| 谢通门县| 砚山县| 会泽县| 封丘县| 乌鲁木齐县| 北辰区| 会同县| 台南县| 新干县| 苍溪县| 公主岭市| 大姚县| 温宿县| 邵武市| 如皋市| 广水市| 罗江县| 女性| 曲周县| 三门峡市| 伊吾县| 调兵山市| 慈利县| 宁陕县| 隆林|