- Learning Data Mining with Python
- Robert Layton
- 117字
- 2021-07-16 13:30:51
Summary
In this chapter, we extended our use of scikit-learn's classifiers to perform classification and introduced the pandas library to manage our data. We analyzed real-world data on basketball results from the NBA, saw some of the problems that even well-curated data introduces, and created new features for our analysis.
We saw the effect that good features have on performance and used an ensemble algorithm, Random forests, to further improve the accuracy.
In the next chapter, we will extend the affinity analysis that we performed in the first chapter to create a program to find similar books. We will see how to use algorithms for ranking and also use approximation to improve the scalability of data mining.
推薦閱讀
- 零基礎(chǔ)學(xué)Visual C++第3版
- 程序設(shè)計(jì)與實(shí)踐(VB.NET)
- C語言程序設(shè)計(jì)習(xí)題解析與上機(jī)指導(dǎo)(第4版)
- C++面向?qū)ο蟪绦蛟O(shè)計(jì)(微課版)
- 深入淺出Android Jetpack
- ElasticSearch Cookbook(Second Edition)
- Creating Data Stories with Tableau Public
- Raspberry Pi Robotic Projects(Third Edition)
- Elasticsearch Essentials
- Visual Basic程序設(shè)計(jì)基礎(chǔ)
- 從零開始學(xué)Unity游戲開發(fā):場(chǎng)景+角色+腳本+交互+體驗(yàn)+效果+發(fā)布
- Pandas入門與實(shí)戰(zhàn)應(yīng)用:基于Python的數(shù)據(jù)分析與處理
- 計(jì)算機(jī)應(yīng)用基礎(chǔ)
- Tkinter GUI Application Development Blueprints
- Node.js核心技術(shù)教程