官术网_书友最值得收藏!

Defining Python

Python is one of the most common programming languages among data scientists, along with R. The main advantage of Python is its flexibility and simplicity. It makes the data analysis and manipulation easy by offering a lot of packages. It shows great performance in analyzing unstructured textual data and has a very good ecosystem of tools and packages for this purpose.

For the purposes of the book, we have chosen Python 3.5.2. It is the most up-to-date version, which implements many improvements compared to Python 2.7. The main advantage in text analysis is an automatic management of Unicode variables. Python 2.7 is still widely used by programmers and data scientists due to a big choice of external libraries, documentation, and online resources. However, the new version has already reached a sufficient level of compatibility with packages, and on top of it, offers multiple new features.

We will use the pip command tool for installation of all libraries and dependencies.

主站蜘蛛池模板: 浪卡子县| 穆棱市| 西宁市| 福贡县| 宁都县| 大埔区| 盘山县| 南靖县| 西昌市| 保德县| 沐川县| 安新县| 南昌市| 易门县| 抚松县| 张掖市| 大冶市| 清新县| 大余县| 彭泽县| 罗源县| 聊城市| 建始县| 兴国县| 凉城县| 康马县| 抚宁县| 潞西市| 嘉祥县| 宁都县| 新巴尔虎右旗| 博湖县| 东平县| 武山县| 婺源县| 富民县| 吉林市| 阿拉尔市| 株洲市| 林芝县| 集安市|