官术网_书友最值得收藏!

Chapter 2. Manipulating Data with Breeze

Data science is, by and large, concerned with the manipulation of structured data. A large fraction of structured datasets can be viewed as tabular data: each row represents a particular instance, and columns represent different attributes of that instance. The ubiquity of tabular representations explains the success of spreadsheet programs like Microsoft Excel, or of tools like SQL databases.

To be useful to data scientists, a language must support the manipulation of columns or tables of data. Python does this through NumPy and pandas, for instance. Unfortunately, there is no single, coherent ecosystem for numerical computing in Scala that quite measures up to the SciPy ecosystem in Python.

In this chapter, we will introduce Breeze, a library for fast linear algebra and manipulation of data arrays as well as many other features necessary for scientific computing and data science.

主站蜘蛛池模板: 巨野县| 布拖县| 玉溪市| 宁武县| 星座| 溆浦县| 无极县| 青神县| 陆丰市| 河北区| 黄石市| 南涧| 修水县| 仁布县| 芦溪县| 呼玛县| 会昌县| 枣庄市| 洛浦县| 凤城市| 偏关县| 怀安县| 乐清市| 杭州市| 仙桃市| 商南县| 大姚县| 黄石市| 诏安县| 绥化市| 环江| 皮山县| 鹿邑县| 姜堰市| 壤塘县| 惠安县| 涿州市| 高密市| 遵化市| 岑溪市| 瑞昌市|