官术网_书友最值得收藏!

How to handle big data

Now that we are convinced the relational model is not a good fit for big data, let's try to figure out ways to handle big data. These are the solutions that paved the way for various NoSQL databases:

  • Clustering: The data should be spread across different nodes in a cluster. The data should be replicated across multiple nodes in order to sustain node failures. This helps spread the data across the cluster, and different nodes contain different subsets of data. This improves performance and provides fault tolerance.

A node is an instance of database software running on a server. Multiple instances of the same database could be running on the same server.

  • Flexible schema: Schemas should be flexible unlike the relational model and should evolve with the data.
  • Relax consistency: We should embrace the concept of eventual consistency, which means data will eventually be propagated to all the nodes in the cluster (in case of replication). Eventual consistency allows data replication across nodes with minimum overhead. This allows for fast writes with the need for distributed locking.
  • Denormalization of data: Denormalize data to optimize queries. This has to be done at the cost of writing and maintaining multiple copies of the same data.
主站蜘蛛池模板: 晋州市| 尼玛县| 襄城县| 徐闻县| 文化| 泰宁县| 京山县| 通河县| 油尖旺区| 乐业县| 安多县| 肃宁县| 达日县| 上犹县| 开江县| 海兴县| 江安县| 宜黄县| 宽甸| 咸宁市| 调兵山市| 黄大仙区| 明水县| 会同县| 吕梁市| 卢氏县| 凭祥市| 封丘县| 乌鲁木齐县| 黄陵县| 陕西省| 伊金霍洛旗| 南召县| 理塘县| 诸暨市| 理塘县| 邻水| 噶尔县| 保亭| 唐山市| 富锦市|