官术网_书友最值得收藏!

Apache Storm

Apache Storm has emerged as the platform of choice for industry leaders to develop distributed, real-time, data processing platforms. It provides a set of primitives that can be used to develop applications that can process a very large amount of data in real time in a highly scalable manner.

Storm is to real-time processing what Hadoop is to batch processing. It is open source software, and managed by Apache Software Foundation. It has been deployed to meet real-time processing needs by companies such as Twitter, Yahoo!, and Flipboard. Storm was first developed by Nathan Marz at BackType, a company that provided social search applications. Later, BackType was acquired by Twitter, and it is a critical part of their infrastructure. Storm can be used for the following use cases:

  • Stream processing: Storm is used to process a stream of data and update a variety of databases in real time. This processing occurs in real time and the processing speed needs to match the input data speed.
  • Continuous computation: Storm can do continuous computation on data streams and stream the results to clients in real time. This might require processing each message as it comes in or creating small batches over a short time. An example of continuous computation is streaming trending topics on Twitter into browsers.
  • Distributed RPC: Storm can parallelize an intense query so that you can compute it in real time.
  • Real-time analytics: Storm can analyze and respond to data that comes from different data sources as they happen in real time.

In this chapter, we will cover the following topics:

  • What is a Storm?
  • Features of Storm
  • Architecture and components of a Storm cluster
  • Terminologies of Storm
  • Programming language
  • Operation modes
主站蜘蛛池模板: 彝良县| 苍山县| 绥化市| 洞口县| 荔波县| 宜兰县| 和田县| 岢岚县| 工布江达县| 麻栗坡县| 邻水| 喜德县| 罗田县| 定兴县| 安顺市| 邮箱| 友谊县| 泰安市| 太白县| 凤台县| 扎囊县| 陇西县| 米林县| 尚义县| 盘锦市| 葫芦岛市| 蓝山县| 顺平县| 额尔古纳市| 临湘市| 西华县| 秭归县| 上犹县| 库伦旗| 泰来县| 甘肃省| 柳州市| 芒康县| 增城市| 涿州市| 中卫市|