- Hands-On Machine Learning with Microsoft Excel 2019
- Julio Cesar Rodriguez Martino
- 424字
- 2021-06-24 15:11:02
Understanding supervised learning with decision trees
The decision tree algorithm uses a tree-like model of decisions. Its name is derived from the graphical representation of the cascading process that partitions the records. The algorithm chooses the input variables that better split the dataset into subsets that are more pure in terms of the target variable, ideally a subset that contains only one value of this variable. Decision trees are some of the most widely used and easy to understand classification algorithms.
The outcome of the tree algorithm calculation is a set of simple rules that explain which values or intervals of the input values split the original data better. The fact that the results and the path followed to get to them can be clearly shown gives decision trees an advantage over other algorithms. Explainability is a serious problem for some machine learning and artificial intelligence systems – which are mostly used as black boxes – and is a study subject in itself.
In complex problems, we need to decide when to stop the tree development. A large number of features can lead to a very large and complex tree, so the number of branches and the length of the tree are usually limited by the user.
Entropy is a very important concept in decision trees and the way of quantifying the purity of each subsample. It measures the amount of information contained in each leaf of the tree. The lower the entropy, the larger the amount of information. Zero entropy means that a subset contains only one value of the target variable, while a value of one represents a subset that contains the same amount of both values. This concept will be explained later with examples.
Using the entropy that's calculated in every step, the algorithm chooses the best variable to split the data and recursively repeats the same procedure. The user can decide how to stop the calculation, either when all subsets have an entropy of zero, when there are no more features to split by, or a minimum entropy level.
The input features that are best suited for use in a decision tree are the categorical ones. In case of a continuous, numerical variable, it should be first converted into categories by dividing it into ranges; for example, A > 0.5 would be A1 and A ≤ 0.5 would be A2.
Let's look at an example that explains the concept of the decision tree algorithm.
- Learning Spring Boot
- Creating Dynamic UIs with Android Fragments(Second Edition)
- Sybase數據庫在UNIX、Windows上的實施和管理
- 企業級數據與AI項目成功之道
- 數據庫設計與應用(SQL Server 2014)(第二版)
- 大數據架構商業之路:從業務需求到技術方案
- 云計算寶典:技術與實踐
- 大數據測試技術:數據采集、分析與測試實踐(在線實驗+在線自測)
- 數字化轉型實踐:構建云原生大數據平臺
- Learning Ansible
- Arquillian Testing Guide
- 社交網站的數據挖掘與分析(原書第2版)
- 數字孿生
- MySQL 8.0從入門到實戰
- 達夢數據庫集群