- Learning Apache Cassandra(Second Edition)
- Sandeep Yarabarla
- 144字
- 2021-07-03 00:19:24
MapReduce and Spark
MapReduce is a technique for performing aggregate processing on large amounts of data in parallel; it's a particularly common technique in data analytics applications. Cassandra does not offer built-in MapReduce capabilities, but it can be integrated with Hadoop in order to perform MapReduce operations across Cassandra data sets, or Spark for real-time data analysis. The DataStax enterprise product provides integration with both of these tools out of the box.
Spark is a fast, distributed, and expressive computational engine used for large-scale data processing similar to MapReduce. It is much more efficient than MapReduce and runs with resource managers such as Mesos and Yarn. It can read data from various sources such as Hadoop or Cassandra or even streams such as Kafka. DataStax provides a Spark-Cassandra connector to load data from Cassandra into Spark and run batch computations on the data.
- 電氣自動化專業英語(第3版)
- 火格局的時空變異及其在電網防火中的應用
- Mastercam 2017數控加工自動編程經典實例(第4版)
- SCRATCH與機器人
- 程序設計語言與編譯
- PostgreSQL 10 Administration Cookbook
- 機器人人工智能
- INSTANT VMware vCloud Starter
- ZigBee無線通信技術應用開發
- Mastering Predictive Analytics with scikit:learn and TensorFlow
- 步步驚“芯”
- 企業級Web開發實戰
- Hands-On Microservices with C#
- 網絡信息安全項目教程
- SolarWinds Server & Application Monitor:Deployment and Administration