官术网_书友最值得收藏!

Defining Python

Python is one of the most common programming languages among data scientists, along with R. The main advantage of Python is its flexibility and simplicity. It makes the data analysis and manipulation easy by offering a lot of packages. It shows great performance in analyzing unstructured textual data and has a very good ecosystem of tools and packages for this purpose.

For the purposes of the book, we have chosen Python 3.5.2. It is the most up-to-date version, which implements many improvements compared to Python 2.7. The main advantage in text analysis is an automatic management of Unicode variables. Python 2.7 is still widely used by programmers and data scientists due to a big choice of external libraries, documentation, and online resources. However, the new version has already reached a sufficient level of compatibility with packages, and on top of it, offers multiple new features.

We will use the pip command tool for installation of all libraries and dependencies.

主站蜘蛛池模板: 嵊泗县| 中方县| 胶南市| 温州市| 临颍县| 肇东市| 景谷| 本溪市| 偏关县| 陕西省| 紫阳县| 迁西县| 三穗县| 开原市| 巴塘县| 安溪县| 麟游县| 婺源县| 疏勒县| 奉节县| 淅川县| 凤翔县| 建阳市| 大名县| 临泉县| 寿宁县| 新兴县| 瑞丽市| 五常市| 岳阳市| 扎鲁特旗| 云梦县| 东辽县| 阿拉善盟| 梨树县| 山西省| 北宁市| 卓尼县| 扶绥县| 黎川县| 工布江达县|