- Learning Apache Cassandra(Second Edition)
- Sandeep Yarabarla
- 208字
- 2021-07-03 00:19:22
How to handle big data
Now that we are convinced the relational model is not a good fit for big data, let's try to figure out ways to handle big data. These are the solutions that paved the way for various NoSQL databases:
- Clustering: The data should be spread across different nodes in a cluster. The data should be replicated across multiple nodes in order to sustain node failures. This helps spread the data across the cluster, and different nodes contain different subsets of data. This improves performance and provides fault tolerance.
A node is an instance of database software running on a server. Multiple instances of the same database could be running on the same server.
- Flexible schema: Schemas should be flexible unlike the relational model and should evolve with the data.
- Relax consistency: We should embrace the concept of eventual consistency, which means data will eventually be propagated to all the nodes in the cluster (in case of replication). Eventual consistency allows data replication across nodes with minimum overhead. This allows for fast writes with the need for distributed locking.
- Denormalization of data: Denormalize data to optimize queries. This has to be done at the cost of writing and maintaining multiple copies of the same data.
推薦閱讀
- Go Machine Learning Projects
- Ansible Quick Start Guide
- 微型計(jì)算機(jī)控制技術(shù)
- 模型制作
- 基于多目標(biāo)決策的數(shù)據(jù)挖掘方法評(píng)估與應(yīng)用
- 具比例時(shí)滯遞歸神經(jīng)網(wǎng)絡(luò)的穩(wěn)定性及其仿真與應(yīng)用
- 計(jì)算機(jī)網(wǎng)絡(luò)原理與技術(shù)
- Linux Shell編程從初學(xué)到精通
- 基于Proteus的單片機(jī)應(yīng)用技術(shù)
- TensorFlow Deep Learning Projects
- 深度學(xué)習(xí)原理與 TensorFlow實(shí)踐
- 手把手教你學(xué)Flash CS3
- 大數(shù)據(jù)素質(zhì)讀本
- 樂(lè)高創(chuàng)意機(jī)器人教程(中級(jí) 上冊(cè) 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實(shí)踐指導(dǎo)叢書(shū))
- 微機(jī)組裝與維護(hù)教程