- JavaScript Cloud Native Development Cookbook
- John Gilbert
- 306字
- 2021-07-16 18:03:29
How it works...
The most important characteristic of a data lake is that it stores data in perpetuity. The only way to really meet this requirement is to use object storage, such as AWS S3. S3 provides 11 nines of durability. Said another way, S3 provides 99.999999999% durability of objects over a given year. It is also fully managed and provides life cycle management features to age objects into cold storage. Note that the bucket is defined with the DeletionPolicy set to Retain. This highlights that even if the stack is deleted, we still want to ensure that we are not inappropriately deleting this valuable data.
We are using Kinesis Firehose because it performs the heavy lifting of writing the events to the bucket. It provides a buffer based on the time and size, compression, encryption, and error handling. To simplify this recipe, I did not use compression or encryption, but it is recommended that you use these features.
This recipe defines one delivery stream, because in this cookbook, our stream topology consists of only one stream with ${cf:cncb-event-stream-${opt:stage}.streamArn}. In practice, your topology will consist of multiple streams, and you will define one Firehose delivery stream per Kinesis stream to ensure that the data lake is capturing all events. We set prefix to ${cf:cncb-event-stream-${opt:stage}.streamName}/ so that we can easily distinguish the events in the data lake by their stream.
Another important characteristic of a data lake is that the data is stored in its raw format, without modification. To this end, the transformer function adorns all available metadata about the specific Kinesis stream and Firehose delivery stream, to ensure that all available information is collected. In the Replaying events recipe, we will see how this metadata can be leveraged. Also, note that transformer adds the end-of-line character (\n) to facilitate future processing of the data.
- Premiere Pro與After Effects從入門到實(shí)戰(zhàn)(微課版)
- 數(shù)字圖像密碼算法詳解:基于C、C#與MATLAB
- Android底層開(kāi)發(fā)技術(shù)實(shí)戰(zhàn)詳解
- 移動(dòng)通信技術(shù)與網(wǎng)絡(luò)優(yōu)化(第2版)
- 快修巧修新型等離子和高清晰度電視機(jī)
- 手繪圖說(shuō)萬(wàn)用表使用
- EPON/GPON技術(shù)問(wèn)答
- 廈華/海爾新型彩色電視機(jī)速修圖解
- Instant BrainShark
- 芯片設(shè)計(jì)——CMOS模擬集成電路版圖設(shè)計(jì)與驗(yàn)證:基于Cadence IC 6.1.7(第2版)
- 電子電路知識(shí)與實(shí)踐課堂
- 5G新時(shí)代與邊緣計(jì)算
- 高可靠性電子產(chǎn)品工藝設(shè)計(jì)及案例分析
- LTE-V2X測(cè)試與仿真從入門到精通
- 通信系統(tǒng)導(dǎo)論