官术网_书友最值得收藏!

Chapter 5. Controlling the Flow of Data

In the previous chapters, you learned to transform your data in many ways. Now suppose you collect results from a survey. You receive several files with the data and those files have different formats. You have to merge those files somehow, and generate a unified view of the information. Not only that, you want to remove the rows of data whose content is irrelevant. Finally, based on the rows that interest you, you want to create another file with some statistics. This kind of requirement is very common, but requires more background in PDI.

In this chapter, you will learn how to implement this kind of task with Kettle. In particular, we will cover the following topics:

  • Copying and distributing rows
  • Splitting the stream based on conditions
  • Merging streams

You will also apply these concepts in the treatment of invalid data.

主站蜘蛛池模板: 永寿县| 上饶市| 江永县| 龙井市| 交城县| 灌南县| 高密市| 奉新县| 长阳| 贺兰县| 江津市| 霍林郭勒市| 东丰县| 高陵县| 冷水江市| 苏州市| 甘谷县| 财经| 琼结县| 宜良县| 中超| 稷山县| 阜宁县| 东海县| 榆树市| 东兰县| 广灵县| 界首市| 施甸县| 新营市| 林甸县| 桃源县| 曲松县| 忻城县| 星座| 敦化市| 磴口县| 贺兰县| 库车县| 南涧| 白沙|