官术网_书友最值得收藏!

ETL patterns

ETL patterns is another area of computing that lends itself very well to serverless platforms. At a high level, ETL jobs comprise the following three steps:

  • Extracting data from one data source
  • Transforming that data appropriately
  • Loading the processed data into another data source

Often used in analytics and/or data warehousing, ETL jobs are hard to escape. Since this problem is again ephemeral and because users would probably like their ETL jobs to execute as quickly as possible, serverless systems are a great platform in this problem space. While serverless computation is typically short-lived, we will see how ETL processes can be designed to be long-running in order to work through large amounts of data.

In the fan-out pattern, discussed in Chapter 5Scaling Out with the Fan-Out Pattern, a single unit of work will be broken up into multiple smaller units of work and processed in parallel. This pattern may be used as a standalone system or as a subcomponent in a more extensive system. We'll build out an application using the fan-out pattern in isolation, but later discuss how it can work as a piece in a more extensive system.

Messaging patterns themselves can be an entire class of design pattern. In our context, we will show how to use this as a general pattern to process data asynchronously with a known or fixed amount of processing power. Chapter 6, Asynchronous Processing with the Messaging Pattern, will walk through a full example of this pattern and its variants in a serverless context.

主站蜘蛛池模板: 清镇市| 来宾市| 瓮安县| 江津市| 城固县| 永丰县| 萨嘎县| 彭阳县| 卓资县| 宁都县| 巧家县| 湖南省| 宜宾市| 天镇县| 乌兰县| 日土县| 南郑县| 广汉市| 京山县| 南昌县| 舒兰市| 龙江县| 闻喜县| 舟曲县| 普兰县| 临清市| 青神县| 靖安县| 霸州市| 南华县| 中阳县| 婺源县| 洱源县| 台中县| 项城市| 仁寿县| 黔西| 密山市| 桐柏县| 江门市| 九龙坡区|