- Learning Apache Cassandra(Second Edition)
- Sandeep Yarabarla
- 144字
- 2021-07-03 00:19:24
MapReduce and Spark
MapReduce is a technique for performing aggregate processing on large amounts of data in parallel; it's a particularly common technique in data analytics applications. Cassandra does not offer built-in MapReduce capabilities, but it can be integrated with Hadoop in order to perform MapReduce operations across Cassandra data sets, or Spark for real-time data analysis. The DataStax enterprise product provides integration with both of these tools out of the box.
Spark is a fast, distributed, and expressive computational engine used for large-scale data processing similar to MapReduce. It is much more efficient than MapReduce and runs with resource managers such as Mesos and Yarn. It can read data from various sources such as Hadoop or Cassandra or even streams such as Kafka. DataStax provides a Spark-Cassandra connector to load data from Cassandra into Spark and run batch computations on the data.
- Hands-On Intelligent Agents with OpenAI Gym
- 21小時學通AutoCAD
- 精通MATLAB圖像處理
- 控制與決策系統(tǒng)仿真
- Julia 1.0 Programming
- VB語言程序設(shè)計
- CompTIA Linux+ Certification Guide
- 21天學通C語言
- 基于32位ColdFire構(gòu)建嵌入式系統(tǒng)
- Excel 2007常見技法與行業(yè)應(yīng)用實例精講
- 氣動系統(tǒng)裝調(diào)與PLC控制
- Dreamweaver CS6精彩網(wǎng)頁制作與網(wǎng)站建設(shè)
- 電腦日常使用與維護322問
- Mastering Ansible(Second Edition)
- 常用傳感器技術(shù)及應(yīng)用(第2版)