官术网_书友最值得收藏!

Handling big data

One massive problem faced by legacy RDBMS systems is difficulty managing Big Data (https://en.wikipedia.org/wiki/Big_data). Examples would include data produced by the NASA Center for Climate Change, the Human Genome Project, which analyzes strands of DNA, or the Sloan Digital Sky Survey, which collects astronomical data. RDBMS systems are designed to maximize storage, which was an expensive resource 50 years ago. In the 21st century, storage costs have dropped dramatically, making this a secondary consideration. Another aspect of RDBMS systems is their ability to provide flexibility by way of creating relations between tables, which by its very nature introduces overheads, compounded when handling big data.

MongoDB addresses the needs of big data by incorporating modern algorithms such as map reduce (https://en.wikipedia.org/wiki/MapReduce), which allows for parallel distributed processing on a cluster of servers. In addition, MongoDB has a feature referred to as sharding, which allows fragments of a database to be stored and processed on multiple servers. 

It should be noted that although MongoDB is designed to handle big data, it is actually more of a general purpose platform. If your only need is to handle big data, it might be worth your while to investigate Apache Cassandra ( https://cassandra.apache.org/) with Hadoop ( http://hadoop.apache.org/), which is expressly designed to handle massive amounts of data.
主站蜘蛛池模板: 凤阳县| 十堰市| 无极县| 新巴尔虎右旗| 资兴市| 德格县| 噶尔县| 营山县| 万宁市| 诸城市| 柳林县| 府谷县| 屯昌县| 林甸县| 静宁县| 江安县| 宜昌市| 太仆寺旗| 霍林郭勒市| 福泉市| 建湖县| 安乡县| 皮山县| 南充市| 时尚| 县级市| 尤溪县| 万山特区| 香河县| 姜堰市| 安化县| 抚州市| 泸溪县| 天等县| 临夏市| 阿合奇县| 鸡泽县| 紫金县| 营山县| 东乡县| 财经|