官术网_书友最值得收藏!

Summary

In this chapter, we introduced data mining using Python. If you were able to run the code in this section (note that the full code is available in the supplied code package), then your computer is set up for much of the rest of the book. Other Python libraries will be introduced in later chapters to perform more specialized tasks.

We used the IPython Notebook to run our code, which allows us to immediately view the results of a small section of the code. This is a useful framework that will be used throughout the book.

We introduced a simple affinity analysis, finding products that are purchased together. This type of exploratory analysis gives an insight into a business process, an environment, or a scenario. The information from these types of analysis can assist in business processes, finding the next big medical breakthrough, or creating the next artificial intelligence.

Also, in this chapter, there was a simple classification example using the OneR algorithm. This simple algorithm simply finds the best feature and predicts the class that most frequently had this value in the training dataset.

Over the next few chapters, we will expand on the concepts of classification and affinity analysis. We will also introduce the scikit-learn package and the algorithms it includes.

主站蜘蛛池模板: 勐海县| 南阳市| 余姚市| 招远市| 盖州市| 庄浪县| 象山县| 聂拉木县| 兖州市| 精河县| 龙川县| 昌都县| 遂昌县| 台北市| 定襄县| 陈巴尔虎旗| 沙雅县| 嫩江县| 樟树市| 临颍县| 犍为县| 瑞丽市| 伊宁县| 屏东市| 盈江县| 清远市| 余干县| 葫芦岛市| 稷山县| 洛浦县| 安龙县| 文化| 丰原市| 井研县| 盐津县| 大悟县| 婺源县| 中卫市| 卓尼县| 新乡县| 安平县|