官术网_书友最值得收藏!

The Analytics Toolkit

There are several platforms today that are used for large-scale data analytics. At a broad level, these are pided into platforms that are used primarily for data mining, such as analysis of large datasets using NoSQL platforms, and those that are used for data science—that is, machine learning and predictive analytics. Oftentimes, the solution may have both the characteristics—a robust underlying platform for storing and managing data, and solutions that have been built on top of them that provide additional capabilities in data science.

In this chapter, we will show you how to install and configure your Analytics Toolkit, a collection of software that we'll use for the rest of the chapters:

  • Components of the Analytics Toolkit
  •  System recommendations
    • Installing on a laptop or workstation
    • Installing on the cloud
  • Installing Hadoop
    • Hadoop distributions
    • Cloudera Distribution of Hadoop (CDH)
  • Installing Spark
  • Installing R and Python
主站蜘蛛池模板: 久治县| 女性| 固阳县| 奈曼旗| 牙克石市| 会昌县| 依安县| 镶黄旗| 凤凰县| 上饶市| 咸丰县| 鹤庆县| 马尔康县| 承德市| 长沙市| 论坛| 临泉县| 出国| 邹城市| 本溪市| 内江市| 忻州市| 南漳县| 兴文县| 车致| 保靖县| 子长县| 绥宁县| 伊金霍洛旗| 丰宁| 河南省| 凤翔县| 淮南市| 周宁县| 德兴市| 长宁区| 明溪县| 苍山县| 法库县| 南昌县| 新郑市|