- Apache Hadoop 3 Quick Start Guide
- Hrishikesh Vijay Karambelkar
- 241字
- 2021-06-10 19:18:41
Choosing the right Hadoop distribution
We have seen the evolution of Hadoop from a simple lab experiment tool to one of the most famous projects of Apache Software Foundation in the previous section. When the evolution started, many commercial implementations of Hadoop spawned. Today, we see more than 10 different implementations that exist in the market (Source). There is a debate about whether to go with full open source-based Hadoop or with a commercial Hadoop implementation. Each approach has its pros and cons. Let's look at the open source approach.
Pros of open source-based Hadoop include the following:
- With a complete open source approach, you can take full advantage of community releases.
- It's easier and faster to reach customers due to software being free. It also reduces the initial cost of investment.
- Open source Hadoop supports open standards, making it easy to integrate with any system.
Cons of open source-based Hadoop include the following:
- In the complete open source Hadoop scenario, it takes longer to build implementations compared to commercial software, due to lack of handy tools that speed up implementation
- Supporting customers and fixing issues can become a tedious job due to the chaotic nature of the open source community
- The roadmap of the product cannot be controlled/ginfluenced based on business needs
Given these challenges, many times, companies prefer to go with commercial implementations of Apache Hadoop. We will cover some of the key Hadoop distributions in this section.
- 基于LabWindows/CVI的虛擬儀器設計與應用
- 2018西門子工業專家會議論文集(上)
- 機器學習流水線實戰
- 數據庫系統原理及應用教程(第5版)
- 基于32位ColdFire構建嵌入式系統
- MATLAB/Simulink權威指南:開發環境、程序設計、系統仿真與案例實戰
- Nginx高性能Web服務器詳解
- 運動控制系統應用與實踐
- Machine Learning with Apache Spark Quick Start Guide
- Azure PowerShell Quick Start Guide
- Applied Data Visualization with R and ggplot2
- MATLAB-Simulink系統仿真超級學習手冊
- 電腦上網入門
- 常用傳感器技術及應用(第2版)
- 機器人制作入門(第4版)