官术网_书友最值得收藏!

Introduction

The first chapter got you started with some basic Python, and then progressed to equipping you with tools for data exploration. Specifically, we performed operations such as loading the dataset and verifying data integrity, and we performed our first exploratory analysis on our case study dataset.

In this chapter, we finish our exploration of the data by examining the response variable. After we've concluded that the data is of high quality and makes sense, we will be ready to move forward with the practical concerns of developing machine learning models. We will take our first steps with scikit-learn, one of the most popular machine learning packages available in the Python language. Before learning the details of how mathematical models work in the next chapter, here we'll start to get comfortable with the syntax for using them in scikit-learn.

We will also learn some common techniques for how to answer the question, "Is this model good or not?" There are many possible ways to approach model evaluation. For business applications, some kind of financial analysis to determine the value that could be created by the model is usually necessary. However, we will reserve this for the end of the book.

There are several important model evaluation criteria that are considered standard knowledge in data science and machine learning. We will cover a few of the most widely used classification model performance metrics here, to give you a strong foundation.

主站蜘蛛池模板: 襄垣县| 高碑店市| 秭归县| 海伦市| 哈尔滨市| 鄂托克前旗| 确山县| 杂多县| 通榆县| 博野县| 远安县| 安仁县| 太仓市| 瑞丽市| 云南省| 改则县| 平乐县| 乐昌市| 肃宁县| 崇阳县| 资阳市| 罗江县| 韩城市| 闵行区| 璧山县| 商城县| 巨鹿县| 汾阳市| 宁都县| 定襄县| 偃师市| 宜宾县| 那曲县| 靖远县| 焉耆| 宜黄县| 泸定县| 延庆县| 瓦房店市| 多伦县| 定州市|