- Pentaho Data Integration Beginner's Guide(Second Edition)
- María Carina Roldán
- 150字
- 2021-07-23 15:47:00
Chapter 5. Controlling the Flow of Data
In the previous chapters, you learned to transform your data in many ways. Now suppose you collect results from a survey. You receive several files with the data and those files have different formats. You have to merge those files somehow, and generate a unified view of the information. Not only that, you want to remove the rows of data whose content is irrelevant. Finally, based on the rows that interest you, you want to create another file with some statistics. This kind of requirement is very common, but requires more background in PDI.
In this chapter, you will learn how to implement this kind of task with Kettle. In particular, we will cover the following topics:
- Copying and distributing rows
- Splitting the stream based on conditions
- Merging streams
You will also apply these concepts in the treatment of invalid data.
推薦閱讀
- 計算機應用
- 面向STEM的mBlock智能機器人創新課程
- Learning Microsoft Azure Storage
- PowerShell 3.0 Advanced Administration Handbook
- Mobile DevOps
- Dreamweaver CS3網頁設計與網站建設詳解
- 大數據挑戰與NoSQL數據庫技術
- 新編計算機組裝與維修
- 大數據驅動的機械裝備智能運維理論及應用
- 多媒體制作與應用
- 青少年VEX IQ機器人實訓課程(初級)
- Building Google Cloud Platform Solutions
- Hands-On DevOps
- Deep Learning Essentials
- Learning Couchbase