官术网_书友最值得收藏!

Apache Storm

Apache Storm has emerged as the platform of choice for industry leaders to develop distributed, real-time, data processing platforms. It provides a set of primitives that can be used to develop applications that can process a very large amount of data in real time in a highly scalable manner.

Storm is to real-time processing what Hadoop is to batch processing. It is open source software, and managed by Apache Software Foundation. It has been deployed to meet real-time processing needs by companies such as Twitter, Yahoo!, and Flipboard. Storm was first developed by Nathan Marz at BackType, a company that provided social search applications. Later, BackType was acquired by Twitter, and it is a critical part of their infrastructure. Storm can be used for the following use cases:

  • Stream processing: Storm is used to process a stream of data and update a variety of databases in real time. This processing occurs in real time and the processing speed needs to match the input data speed.
  • Continuous computation: Storm can do continuous computation on data streams and stream the results to clients in real time. This might require processing each message as it comes in or creating small batches over a short time. An example of continuous computation is streaming trending topics on Twitter into browsers.
  • Distributed RPC: Storm can parallelize an intense query so that you can compute it in real time.
  • Real-time analytics: Storm can analyze and respond to data that comes from different data sources as they happen in real time.

In this chapter, we will cover the following topics:

  • What is a Storm?
  • Features of Storm
  • Architecture and components of a Storm cluster
  • Terminologies of Storm
  • Programming language
  • Operation modes
主站蜘蛛池模板: 黄大仙区| 福海县| 达日县| 黑龙江省| 隆安县| 大宁县| 潮安县| 家居| 扎鲁特旗| 蓝山县| 西安市| 长丰县| 潍坊市| 于都县| 镇平县| 北碚区| 大厂| 新民市| 海阳市| 南阳市| 祁东县| 郸城县| 榆林市| 乌苏市| 册亨县| 安岳县| 泸定县| 隆回县| 双流县| 溆浦县| 平阳县| 三原县| 桂林市| 阿拉善盟| 哈密市| 沙洋县| 津南区| 高州市| 贡山| 高安市| 玛多县|