- Apache Spark 2.x for Java Developers
- Sourav Gulati Sumit Kumar
- 118字
- 2021-07-02 19:02:01
Finding the sum of all even numbers in an RDD of integers
Let's first create an RDD of integers as follows:
scala>valintRDD = sc.parallelize(Array(1,4,5,6,7,10,15))
intRDD: org.apache.spark.rdd.RDD[Int] = ParallelCollectionRDD[0] at parallelize at <console>:24
The next step is to filter all the even elements in this RDD. So, we will execute a filter operation on the RDD, as follows:
scala>valevenNumbersRDD=intRDD.filter(i => (i%2==0))
evenNumbersRDD: org.apache.spark.rdd.RDD[Int] = MapPartitionsRDD[3] at filter at <console>:26
The preceding operation will fetch those elements for which 2 is a factor, that is, even elements .
Now, we will sum all the elements of this RDD as follows:
scala>val sum =evenNumbersRDD.sum
sum: Double = 20.0
scala> sum
res5: Double = 20.0
It is shown in the following screenshot:

推薦閱讀
- Microsoft Exchange Server PowerShell Cookbook(Third Edition)
- PostgreSQL for Data Architects
- Rust實戰
- SpringMVC+MyBatis快速開發與項目實戰
- PHP+MySQL網站開發技術項目式教程(第2版)
- Java從入門到精通(第4版)
- 數據結構(Java語言描述)
- 匯編語言程序設計(第2版)
- Elasticsearch for Hadoop
- Mastering JavaScript Design Patterns(Second Edition)
- 從Java到Web程序設計教程
- Java面向對象程序設計
- Building Microservices with .NET Core
- C#面向對象程序設計(第2版)
- 計算機組裝與維護(第二版)