官术网_书友最值得收藏!

Summary

In this chapter, we explained what a data stream is and gave related examples, as well as looking at the real-time use cases related to data streams. We got readers acquainted and introduced setup and quick execution for different real-time data ingestion tools like Flume, NiFi, Logstash, and Fluentd. We also explained where these data ingestion tools stand in terms of reliability and scalability. Then, we tried to compare the data ingestion tools so that the reader could pick the tools as per the need for their use case, after comparing pros and cons. They can run the examples by running the code bundled in JAR easily on standalone as well as in cluster mode. In the end, we gave the reader a real-time problem to solve using data ingestion tools along with pseudo code, so that we could focus on coding the example rather than finding right solution.

As we are now aware of different types of data streaming tools, in the next chapter we will focus on setting up Storm. Storm is an open source distributed, resilient, real-time processing engine. Setting up includes download, installation, configuration, and running an example to test whether setup is working or not.

主站蜘蛛池模板: 边坝县| 垣曲县| 通城县| 子洲县| 栖霞市| 东丽区| 临澧县| 龙胜| 利辛县| 郧西县| 邻水| 泾阳县| 乌兰察布市| 荥经县| 九寨沟县| 新泰市| 张北县| 鄢陵县| 荥经县| 吉林市| 高青县| 大埔县| 安吉县| 盐边县| 大渡口区| 陕西省| 黑水县| 长宁县| 永定县| 雅江县| 铅山县| 定州市| 天全县| 开化县| 静海县| 宜昌市| 桦川县| 延长县| 全州县| 滦南县| 临高县|