- Mastering Spark for Data Science
- Andrew Morgan Antoine Amend David George Matthew Hallett
- 86字
- 2021-07-09 18:49:33
Summary
In this chapter, we walked through the full setup of an Apache NiFi GDELT ingest pipeline, complete with metadata forks and a brief introduction to visualizing the resulting data. This section is particularly important as GDELT is used extensively throughout the book and the NiFi method is a highly effective way to source data in a scalable and modular way.
In the next chapter, we will get to grips with what to do with the data once it's landed, by looking at schemas and formats.
推薦閱讀
- 實(shí)時(shí)流計(jì)算系統(tǒng)設(shè)計(jì)與實(shí)現(xiàn)
- 協(xié)作機(jī)器人技術(shù)及應(yīng)用
- Photoshop CS4經(jīng)典380例
- Java開(kāi)發(fā)技術(shù)全程指南
- 快學(xué)Flash動(dòng)畫百例
- 自主研拋機(jī)器人技術(shù)
- CompTIA Network+ Certification Guide
- Photoshop CS3圖層、通道、蒙版深度剖析寶典
- 基于ARM 32位高速嵌入式微控制器
- 大數(shù)據(jù)技術(shù)基礎(chǔ):基于Hadoop與Spark
- SMS 2003部署與操作深入指南
- FANUC工業(yè)機(jī)器人配置與編程技術(shù)
- 筆記本電腦維修之電路分析基礎(chǔ)
- Getting Started with Tableau 2018.x
- PostgreSQL High Performance Cookbook