- The Data Science Workshop
- Anthony So Thomas V. Joseph Robert Thas John Andrew Worsley Dr. Samuel Asare
- 294字
- 2021-06-11 18:27:22
Introduction
In previous chapters, where an introduction to machine learning was covered, you were introduced to two broad categories of machine learning; supervised learning and unsupervised learning. Supervised learning can be further pided into two types of problem cases, regression and classification. In the last chapter, we covered regression problems. In this chapter, we will peek into the world of classification problems.
Take a look at the following Figure 3.1:

Figure 3.1: Overview of machine learning algorithms
Classification problems are the most prevalent use cases you will encounter in the real world. Unlike regression problems, where a real numbered value is predicted, classification problems deal with associating an example to a category. Classification use cases will take forms such as the following:
- Predicting whether a customer will buy the recommended product
- Identifying whether a credit transaction is fraudulent
- Determining whether a patient has a disease
- Analyzing images of animals and predicting whether the image is of a dog, cat, or panda
- Analyzing text reviews and capturing the underlying emotion such as happiness, anger, sorrow, or sarcasm
If you observe the preceding examples, there is a subtle difference between the first three and the last two. The first three revolve around binary decisions:
- Customers can either buy the product or not.
- Credit card transactions can be fraudulent or legitimate.
- Patients can be diagnosed as positive or negative for a disease.
Use cases that align with the preceding three genres where a binary decision is made are called binary classification problems. Unlike the first three, the last two associate an example with multiple classes or categories. Such problems are called multiclass classification problems. This chapter will deal with binary classification problems. Multiclass classification will be covered next in Chapter 4, Multiclass Classification with RandomForest.
- Bootstrap Site Blueprints Volume II
- 從零開始:數(shù)字圖像處理的編程基礎(chǔ)與應(yīng)用
- C語言程序設(shè)計案例教程(第2版)
- Learning Selenium Testing Tools with Python
- Java編程指南:基礎(chǔ)知識、類庫應(yīng)用及案例設(shè)計
- 實戰(zhàn)低代碼
- Visual Basic學(xué)習(xí)手冊
- ArcGIS By Example
- Visual C#.NET程序設(shè)計
- UML 基礎(chǔ)與 Rose 建模案例(第3版)
- R Data Analysis Cookbook(Second Edition)
- HTML5從入門到精通 (第2版)
- Python爬蟲、數(shù)據(jù)分析與可視化:工具詳解與案例實戰(zhàn)
- 軟件測試教程
- Go語言底層原理剖析