官术网_书友最值得收藏!

Chapter 1. First Steps to Scalability

Welcome to this book on scalable machine learning with Python.

In this chapter, we will discuss how to learn effectively from big data with Python and how it can be possible using your single machine or a cluster of other machines, which you can get, for instance, from Amazon Web Services (AWS) or the Google Cloud Platform.

In the book, we will be using Python's implementation of machine learning algorithms that are scalable. This means that they can work with a large amount of data and do not crash because of memory constraints. They also take a reasonable amount of time, which is something manageable for a data science prototype and also deployment in production. Chapters are organized around solutions (such as streaming data), algorithms (such as neural networks or ensemble of trees), and frameworks (such as Hadoop or Spark). We will also provide you with some basic reminders about the machine learning algorithms and explain how to make them scalable and suitable to problems with massive datasets.

Given such premises as a start, you'll need to learn the basics (so as to figure out the perspective under which this book has been written) and set up all your basic tools to start reading the chapters immediately.

In this chapter, we will introduce you to the following topics:

  • What scalability actually means
  • What bottlenecks you should pay attention to when dealing with data
  • What kind of problems this book will help you solve
  • How to use Python to analyze datasets at scale effectively
  • How to set up your machine quickly to execute the examples presented in this book

Let's start this journey together around scalable solutions with Python!

主站蜘蛛池模板: 黑水县| 天等县| 阿鲁科尔沁旗| 乐陵市| 东兴市| 莆田市| 宜君县| 额敏县| 漳浦县| 固原市| 思茅市| 长沙市| 深州市| 温宿县| 东乡族自治县| 建宁县| 和林格尔县| 临武县| 浪卡子县| 林州市| 循化| 朔州市| 乐安县| 甘泉县| 电白县| 汉源县| 大理市| 育儿| 洪雅县| 信丰县| 吉木乃县| 舒城县| 南宫市| 黄梅县| 台东市| 延安市| 北安市| 永新县| 武宁县| 青田县| 准格尔旗|