官术网_书友最值得收藏!

Introduction

In this book, we will cover various ensemble techniques and will learn how to ensemble multiple machine learning algorithms to enhance a model's performance. We will use pandas, NumPy, scikit-learn, and Matplotlib, all of which were built for working with Python, as we will do throughout the bookBy now, you should be well aware of data manipulation and exploration.

In this chapter, we will recap how to read and manipulate data in Python, how to analyze and treat missing values, and how to explore data to gain deeper insights. We will use various Python packages, such as numpy and pandas, for data manipulation and exploration, and seaborn packages for data visualization. We will continue to use some or all of these libraries in the later chapters of this book as well. We will also use the Anaconda distribution for our Python coding. If you have not installed Anaconda, you need to download it from https://www.anaconda.com/download. At the time of writing this book, the latest version of Anaconda is 5.2, and comes with both Python 3.6 and Python 2.7. We suggest you download Anaconda for Python 3.6. We will also use the HousePrices dataset, which is available on GitHub.

主站蜘蛛池模板: 永兴县| 延边| 荣昌县| 永宁县| 麻江县| 象州县| 郴州市| 徐州市| 泊头市| 甘德县| 东乡族自治县| 西城区| 通江县| 大连市| 玉树县| 闵行区| 柳林县| 栖霞市| 界首市| 富宁县| 莒南县| 吉安市| 景宁| 曲阜市| 红河县| 大安市| 色达县| 江城| 罗城| 察哈| 钦州市| 明星| 定安县| 油尖旺区| 开平市| 潜江市| 陆良县| 屏东县| 柳河县| 保德县| 峡江县|