官术网_书友最值得收藏!

Chapter 2. Manipulating Data with Breeze

Data science is, by and large, concerned with the manipulation of structured data. A large fraction of structured datasets can be viewed as tabular data: each row represents a particular instance, and columns represent different attributes of that instance. The ubiquity of tabular representations explains the success of spreadsheet programs like Microsoft Excel, or of tools like SQL databases.

To be useful to data scientists, a language must support the manipulation of columns or tables of data. Python does this through NumPy and pandas, for instance. Unfortunately, there is no single, coherent ecosystem for numerical computing in Scala that quite measures up to the SciPy ecosystem in Python.

In this chapter, we will introduce Breeze, a library for fast linear algebra and manipulation of data arrays as well as many other features necessary for scientific computing and data science.

主站蜘蛛池模板: 石首市| 汉川市| 崇信县| 武汉市| 中江县| 奉节县| 锦屏县| 潜江市| 永年县| 南丹县| 巴林左旗| 壤塘县| 章丘市| 乐亭县| 太仓市| 河源市| 浑源县| 绍兴县| 西安市| 墨脱县| 金山区| 都江堰市| 淮南市| 弥渡县| 法库县| 肇庆市| 成都市| 贵州省| 攀枝花市| 遂宁市| 大方县| 茶陵县| 璧山县| 崇文区| 海宁市| 文登市| 闸北区| 通道| 邵阳县| 封丘县| 江都市|