官术网_书友最值得收藏!

Finding the sum of all even numbers in an RDD of integers

Let's first create an RDD of integers as follows:

scala>valintRDD = sc.parallelize(Array(1,4,5,6,7,10,15))
intRDD: org.apache.spark.rdd.RDD[Int] = ParallelCollectionRDD[0] at parallelize at <console>:24

The next step is to filter all the even elements in this RDD. So, we will execute a filter operation on the RDD, as follows:

scala>valevenNumbersRDD=intRDD.filter(i => (i%2==0))
evenNumbersRDD: org.apache.spark.rdd.RDD[Int] = MapPartitionsRDD[3] at filter at <console>:26

The preceding operation will fetch those elements for which 2 is a factor, that is, even elements .

Now, we will sum all the elements of this RDD as follows:

scala>val sum =evenNumbersRDD.sum
sum: Double = 20.0
scala> sum
res5: Double = 20.0

It is shown in the following screenshot:

主站蜘蛛池模板: 贡嘎县| 沂源县| 浙江省| 奉贤区| 滨州市| 綦江县| 花莲市| 柏乡县| 饶河县| 韶山市| 茌平县| 木里| 奇台县| 成都市| 白银市| 鄂州市| 安远县| 扬中市| 达拉特旗| 济南市| 新闻| 钦州市| 无为县| 启东市| 巨野县| 南江县| 辉南县| 富顺县| 和平区| 东辽县| 米林县| 秦皇岛市| 永福县| 双柏县| 锡林浩特市| 鞍山市| 阿巴嘎旗| 含山县| 张家港市| 阿荣旗| 溧水县|