官术网_书友最值得收藏!

Introduction

In this book, we will cover various ensemble techniques and will learn how to ensemble multiple machine learning algorithms to enhance a model's performance. We will use pandas, NumPy, scikit-learn, and Matplotlib, all of which were built for working with Python, as we will do throughout the bookBy now, you should be well aware of data manipulation and exploration.

In this chapter, we will recap how to read and manipulate data in Python, how to analyze and treat missing values, and how to explore data to gain deeper insights. We will use various Python packages, such as numpy and pandas, for data manipulation and exploration, and seaborn packages for data visualization. We will continue to use some or all of these libraries in the later chapters of this book as well. We will also use the Anaconda distribution for our Python coding. If you have not installed Anaconda, you need to download it from https://www.anaconda.com/download. At the time of writing this book, the latest version of Anaconda is 5.2, and comes with both Python 3.6 and Python 2.7. We suggest you download Anaconda for Python 3.6. We will also use the HousePrices dataset, which is available on GitHub.

主站蜘蛛池模板: 朝阳县| 洛川县| 吕梁市| 东海县| 嘉义市| 曲阜市| 府谷县| 新疆| 岑巩县| 正镶白旗| 襄汾县| 郁南县| 安多县| 洛川县| 酒泉市| 吉木萨尔县| 潍坊市| 岳阳市| 凤山县| 黑龙江省| 汉源县| 昭通市| 巴中市| 醴陵市| 水富县| 百色市| 百色市| 恩平市| 磐安县| 长武县| 西和县| 七台河市| 吉木萨尔县| 玉龙| 平利县| 三穗县| 台湾省| 黎川县| 高邮市| 榕江县| 泽库县|