官术网_书友最值得收藏!

What you need for this book

To complete the projects in this book, you will need a version of Python 3.5 or higher. I recommend using Anaconda Python, but any Python distribution will do as long as it is updated and contains the following packages: Numpy, Matplotlib, NetworkX, PyMySQL, Gensim, and NLTK. In Chapter 1, Expanding Your Data Mining Toolbox, we will walk through an easy installation of Python and all these libraries, and each time a library is used later in the book, we will install it or upgrade it together.

Because data mining is obviously data-centric, and because the data sets we are working with are sometimes large or require some type of persistent data storage, I chose to implement some of the data mining algorithms alongside a relational database system. I chose MySQL for accomplishing this since it is an established, easy-to-download and install piece of infrastructure. The chapters where MySQL comes into play are in working with the memory-intensive algorithms in Chapter 2, Association Rule Mining, and Chapter 3, Entity Matching. I also use MySQL for some of the examples in Chapter 9, Mining for Data Anomalies, but it is possible to go through that chapter without MySQL.

主站蜘蛛池模板: 彩票| 仪陇县| 东阳市| 司法| 烟台市| 集贤县| 苏尼特右旗| 宿迁市| 霍城县| 台安县| 铁岭县| 阿拉尔市| 赤壁市| 新龙县| 南昌县| 根河市| 驻马店市| 监利县| 奇台县| 明水县| 华坪县| 齐河县| 博乐市| 葫芦岛市| 桑植县| 东乌珠穆沁旗| 安仁县| 大邑县| 玛多县| 富锦市| 佛山市| 鄂州市| 通海县| 清水县| 原平市| 榆树市| 视频| 天镇县| 沙田区| 临武县| 武城县|