- The Data Science Workshop
- Anthony So Thomas V. Joseph Robert Thas John Andrew Worsley Dr. Samuel Asare
- 237字
- 2021-06-11 18:27:24
Introduction
In the previous chapter, you saw how to build a binary classifier using the famous Logistic Regression algorithm. A binary classifier can only take two different values for its response variables, such as 0 and 1 or yes and no. A multiclass classification task is just an extension. Its response variable can have more than two different values.
In the data science industry, quite often you will face multiclass classification problems. For example, if you were working for Netflix or any other streaming platform, you would have to build a model that could predict the user rating for a movie based on key attributes such as genre, duration, or cast. A potential list of rating values may be: Hate it, Dislike it, Neutral, Like it, Love it. The objective of the model would be to predict the right rating from those five possible values.
Multiclass classification doesn't always mean the response variable will be text. In some datasets, the target variable may be encoded into a numerical form. Taking the same example as discussed, the rating may be coded from 1 to 5: 1 for Hate it, 2 for Dislike it, 3 for Neutral, and so on. So, it is important to understand the meaning of this response variable first before jumping to the conclusion that this is a regression problem.
In the next section, we will be looking at training our first Random Forest classifier.
- Developing Mobile Web ArcGIS Applications
- Learning C++ Functional Programming
- 秒懂設(shè)計(jì)模式
- C語言程序設(shè)計(jì)案例式教程
- 快人一步:系統(tǒng)性能提高之道
- 大數(shù)據(jù)分析與應(yīng)用實(shí)戰(zhàn):統(tǒng)計(jì)機(jī)器學(xué)習(xí)之?dāng)?shù)據(jù)導(dǎo)向編程
- 大話Java:程序設(shè)計(jì)從入門到精通
- 編寫高質(zhì)量代碼:改善Objective-C程序的61個(gè)建議
- 并行編程方法與優(yōu)化實(shí)踐
- 從零開始學(xué)Android開發(fā)
- ASP.NET 4.0 Web程序設(shè)計(jì)
- 黑莓(BlackBerry)開發(fā)從入門到精通
- R的極客理想:量化投資篇
- Visual Basic程序設(shè)計(jì)基礎(chǔ)
- 從“1”開始3D編程