- Hands-On Big Data Analytics with PySpark
- Rudy Lai Bart?omiej Potaczek
- 62字
- 2021-06-24 15:52:34
Parallelization with Spark RDDs
Now that we know how to create RDDs within the text file that we received from the internet, we can look at a different way to create this RDD. Let's discuss parallelization with our Spark RDDs.
In this section, we will cover the following topics:
- What is parallelization?
- How do we parallelize Spark RDDs?
Let's start with parallelization.
推薦閱讀
- Redis使用手冊
- LibGDX Game Development Essentials
- 數(shù)據(jù)產(chǎn)品經(jīng)理高效學習手冊:產(chǎn)品設(shè)計、技術(shù)常識與機器學習
- 數(shù)據(jù)挖掘原理與實踐
- ETL數(shù)據(jù)整合與處理(Kettle)
- 信息系統(tǒng)與數(shù)據(jù)科學
- 劍破冰山:Oracle開發(fā)藝術(shù)
- App+軟件+游戲+網(wǎng)站界面設(shè)計教程
- Game Development with Swift
- iOS and OS X Network Programming Cookbook
- 數(shù)據(jù)驅(qū)動設(shè)計:A/B測試提升用戶體驗
- 數(shù)據(jù)中心數(shù)字孿生應用實踐
- Chef Essentials
- openGauss數(shù)據(jù)庫核心技術(shù)
- Cognitive Computing with IBM Watson