官术网_书友最值得收藏!

Introduction

Our approach to learning in this book is highly applied since hands-on learning is the quickest way to understand abstract concepts. With this in mind, the focus of this chapter is to introduce Jupyter Notebooks—the data science tool that we will be using throughout this book.

Since Jupyter Notebooks have gained mainstream popularity, they have been one of the most important tools for data scientists who use Python. This is because they offer a great environment for a variety of tasks, such as performing quick and dirty analysis, researching model selection, and creating reproducible pipelines. They allow for data to be loaded, transformed, and modeled inside a single file, where it's quick and easy to test out code and explore ideas along the way. Furthermore, all of this can be documented inline using formatted text, which means you can make notes or even produce a structured report.

Other comparable platforms—for example, RStudio or Spyder—offer multiple panels to work between. Frequently, one of these panels will be a Read Eval Prompt Loop (REPL), where code is run on a Terminal session that has saved memory. Code written here may end up being copied and pasted into a different panel within the main codebase, and there may also be additional panels to see visualizations or other files. Such development environments are prone to efficiency issues and can promote bad practices for reproducibility if you're not careful.

Jupyter Notebooks work differently. Instead of having multiple panels for different components of your project, they offer the same functionality in a single component (that is, the Notebook), where the text is displayed along with code snippets, and code outputs are displayed inline. This lets you code efficiently and allows you to look back at previous work for reference, or even make alterations.

We'll start this chapter by explaining exactly what Jupyter Notebooks are and why they are so popular among data scientists. Then, we'll access a Notebook together and go through some exercises to learn how the platform is used.

主站蜘蛛池模板: 库尔勒市| 无棣县| 芦山县| 云和县| 兴业县| 铁岭市| 马龙县| 漳州市| 屏南县| 阳城县| 西乡县| 康乐县| 昆明市| 吴川市| 深圳市| 东莞市| 太和县| 富裕县| 米林县| 沐川县| 苏州市| 凤翔县| 达尔| 黑龙江省| 嵩明县| 安新县| 德清县| 沾益县| 易门县| 弥勒县| 长顺县| 内丘县| 亳州市| 衡南县| 渭源县| 泉州市| 从化市| 巴彦县| 宁津县| 肇庆市| 崇阳县|