官术网_书友最值得收藏!

Chapter 2. Manipulating Data with Breeze

Data science is, by and large, concerned with the manipulation of structured data. A large fraction of structured datasets can be viewed as tabular data: each row represents a particular instance, and columns represent different attributes of that instance. The ubiquity of tabular representations explains the success of spreadsheet programs like Microsoft Excel, or of tools like SQL databases.

To be useful to data scientists, a language must support the manipulation of columns or tables of data. Python does this through NumPy and pandas, for instance. Unfortunately, there is no single, coherent ecosystem for numerical computing in Scala that quite measures up to the SciPy ecosystem in Python.

In this chapter, we will introduce Breeze, a library for fast linear algebra and manipulation of data arrays as well as many other features necessary for scientific computing and data science.

主站蜘蛛池模板: 库车县| 金塔县| 周口市| 濮阳市| 高唐县| 桑植县| 长兴县| 穆棱市| 长岛县| 上饶县| 蓝山县| 广元市| 青冈县| 龙海市| 综艺| 沽源县| 义马市| 景宁| 澄江县| 黔西| 察雅县| 洛宁县| 本溪市| 奈曼旗| 民丰县| 千阳县| 长宁县| 响水县| 金华市| 乌拉特前旗| 昌都县| 揭西县| 中牟县| 轮台县| 视频| 苏尼特右旗| 织金县| 辽宁省| 永胜县| 墨竹工卡县| 周口市|