- Hands-On Machine Learning with Microsoft Excel 2019
- Julio Cesar Rodriguez Martino
- 196字
- 2021-06-24 15:11:02
Deciding whether to train outdoors depending on the weather
Let's suppose we have historical data on the decisions made by an experienced football trainer about training outdoors (outside the gym) or not with her team, including the weather conditions on the days when the decisions were made.
A typical dataset could look as follows:

The dataset was specifically created for this example and, of course, might not represent any real decisions.
In this example, the target variable is Train outside and the rest of the variables are the model features.
According to the data table, a possible decision tree would be as follows:

We choose to start splitting the data by the value of the Outlook feature. We can see that if the value is Overcast, then the decision to train outside is always Yes and does not depend on the values of the other features. Sunny and Rainy can be further split to get an answer.
How can we decide which feature to use first and how to continue? We will use the value of the entropy, measuring how much its value changes when considering different input features.
- 數據之巔:數據的本質與未來
- Spark快速大數據分析(第2版)
- Oracle RAC 11g實戰(zhàn)指南
- 醫(yī)療大數據挖掘與可視化
- OracleDBA實戰(zhàn)攻略:運維管理、診斷優(yōu)化、高可用與最佳實踐
- 企業(yè)級容器云架構開發(fā)指南
- Python數據分析與挖掘實戰(zhàn)(第3版)
- 淘寶、天貓電商數據分析與挖掘實戰(zhàn)(第2版)
- 區(qū)域云計算和大數據產業(yè)發(fā)展:浙江樣板
- 智慧城市中的大數據分析技術
- 利用Python進行數據分析(原書第2版)
- 數據指標體系:構建方法與應用實踐
- MySQL技術內幕:InnoDB存儲引擎
- 數據中心經營之道
- 數字化轉型方法論:落地路徑與數據中臺