官术网_书友最值得收藏!

Chapter 5. Controlling the Flow of Data

In the previous chapters, you learned to transform your data in many ways. Now suppose you collect results from a survey. You receive several files with the data and those files have different formats. You have to merge those files somehow, and generate a unified view of the information. Not only that, you want to remove the rows of data whose content is irrelevant. Finally, based on the rows that interest you, you want to create another file with some statistics. This kind of requirement is very common, but requires more background in PDI.

In this chapter, you will learn how to implement this kind of task with Kettle. In particular, we will cover the following topics:

  • Copying and distributing rows
  • Splitting the stream based on conditions
  • Merging streams

You will also apply these concepts in the treatment of invalid data.

主站蜘蛛池模板: 成武县| 剑川县| 惠来县| 通海县| 天全县| 邢台县| 万州区| 庄河市| 周宁县| 沅陵县| 聂荣县| 邻水| 青龙| 崇明县| 凭祥市| 巴中市| 平塘县| 林甸县| 民县| 安宁市| 威信县| 孝义市| 隆尧县| 伊通| 林周县| 收藏| 邹城市| 湘潭市| 馆陶县| 延津县| 铜陵市| 栾城县| 乐安县| 锡林浩特市| 河北省| 六枝特区| 南漳县| 莫力| 江山市| 喀喇| 西乌珠穆沁旗|