- Mastering Data Mining with Python:Find patterns hidden in your data
- Megan Squire
- 450字
- 2021-08-20 10:33:27
Chapter 1. Expanding Your Data Mining Toolbox
When faced with sensory information, human beings naturally want to find patterns to explain, differentiate, categorize, and predict. This process of looking for patterns all around us is a fundamental human activity, and the human brain is quite good at it. With this skill, our ancient ancestors became better at hunting, gathering, cooking, and organizing. It is no wonder that pattern recognition and pattern prediction were some of the first tasks humans set out to computerize, and this desire continues in earnest today. Depending on the goals of a given project, finding patterns in data using computers nowadays involves database systems, artificial intelligence, statistics, information retrieval, computer vision, and any number of other various subfields of computer science, information systems, mathematics, or business, just to name a few. No matter what we call this activity – knowledge discovery in databases, data mining, data science – its primary mission is always to find interesting patterns.
Despite this humble-sounding mission, data mining has existed for long enough and has built up enough variation in how it is implemented that it has now become a large and complicated field to master. We can think of a cooking school, where every beginner chef is first taught how to boil water and how to use a knife before moving to more advanced skills, such as making puff pastry or deboning a raw chicken. In data mining, we also have common techniques that even the newest data miners will learn: How to build a classifier and how to find clusters in data. The title of this book, however, is Mastering Data Mining with Python, and so, as a mastering-level book, the aim is to teach you some of the techniques you may not have seen in earlier data mining projects.
In this first chapter, we will cover the following topics:
- What is data mining? We will situate data mining in the growing field of other similar concepts, and we will learn a bit about the history of how this discipline has grown and changed.
- How do we do data mining? Here, we compare several processes or methodologies commonly used in data mining projects.
- What are the techniques used in data mining? In this section, we will summarize each of the data analysis techniques that are typically included in a definition of data mining, and we will highlight the more exotic or underappreciated techniques that we will be covering in this mastering-level book.
- How do we set up a data mining work environment? Finally, we will walk through setting up a Python-based development environment that we will use to complete the projects in the rest of this book.
- .NET之美:.NET關鍵技術深入解析
- Visual Studio 2012 Cookbook
- 自己動手寫Java虛擬機
- PyQt從入門到精通
- Programming ArcGIS 10.1 with Python Cookbook
- Java虛擬機字節碼:從入門到實戰
- Lua程序設計(第4版)
- Reactive Android Programming
- Java EE 7 Performance Tuning and Optimization
- Regression Analysis with Python
- Cocos2d-x Game Development Blueprints
- Machine Learning for Developers
- Flink技術內幕:架構設計與實現原理
- 算法設計與分析:基于C++編程語言的描述
- HTML5移動前端開發基礎與實戰(微課版)