官术网_书友最值得收藏!

Getting Started with Data Mining

We are collecting information about our world on a scale that has never been seen before in the history of humanity. Along with this trend, we are now placing more day-to-day importance on the use of this information in everyday life. We now expect our computers to translate web pages into other languages, predict the weather with high accuracy, suggest books we would like, and to diagnose our health issues. These expectations will grow into the future, both in application breadth and efficacy. Data Mining is a methodology that we can employ to train computers to make decisions with data and forms the backbone of many high-tech systems of today.

The Python programming language is fast growing in popularity, for a good reason. It gives the programmer flexibility, it has many modules to perform different tasks, and Python code is usually more readable and concise than in any other languages. There is a large and an active community of researchers, practitioners, and beginners using Python for data mining.

In this chapter, we will introduce data mining with Python. We will cover the following topics

  • What is data mining and where can we use it?
  • Setting up a Python-based environment to perform data mining
  • An example of affinity analysis, recommending products based on purchasing habits
  • An example of (a classic) classification problem, predicting the plant species based on its measurement
主站蜘蛛池模板: 株洲市| 玉树县| 禄丰县| 达拉特旗| 南皮县| 四会市| 晴隆县| 桃园市| 鄂托克旗| 兴化市| 恩平市| 英德市| 渑池县| 全椒县| 南木林县| 盐亭县| 开封市| 大同县| 宁都县| 拜城县| 丽水市| 福鼎市| 贵南县| 蕲春县| 伊金霍洛旗| 铁岭市| 大邑县| 西藏| 三河市| 彭泽县| 余姚市| 蓝山县| 名山县| 泰兴市| 三明市| 土默特左旗| 台中县| 青川县| 壶关县| 红原县| 凤翔县|