官术网_书友最值得收藏!

Finding the sum of all even numbers in an RDD of integers

Let's first create an RDD of integers as follows:

scala>valintRDD = sc.parallelize(Array(1,4,5,6,7,10,15))
intRDD: org.apache.spark.rdd.RDD[Int] = ParallelCollectionRDD[0] at parallelize at <console>:24

The next step is to filter all the even elements in this RDD. So, we will execute a filter operation on the RDD, as follows:

scala>valevenNumbersRDD=intRDD.filter(i => (i%2==0))
evenNumbersRDD: org.apache.spark.rdd.RDD[Int] = MapPartitionsRDD[3] at filter at <console>:26

The preceding operation will fetch those elements for which 2 is a factor, that is, even elements .

Now, we will sum all the elements of this RDD as follows:

scala>val sum =evenNumbersRDD.sum
sum: Double = 20.0
scala> sum
res5: Double = 20.0

It is shown in the following screenshot:

主站蜘蛛池模板: 安化县| 林周县| 奈曼旗| 胶州市| 商河县| 北宁市| 房山区| 蓝田县| 锦屏县| 河南省| 黑河市| 阿合奇县| 静宁县| 舒城县| 江阴市| 洱源县| 政和县| 德钦县| 民丰县| 崇义县| 台江县| 高尔夫| 杭州市| 禹州市| 贵州省| 鹿泉市| 合肥市| 新乡县| 青海省| 通州区| 安新县| 虎林市| 全南县| 行唐县| 甘泉县| 施甸县| 白河县| 邻水| 阳西县| 攀枝花市| 永昌县|