- Hands-On Machine Learning with Microsoft Excel 2019
- Julio Cesar Rodriguez Martino
- 196字
- 2021-06-24 15:11:02
Deciding whether to train outdoors depending on the weather
Let's suppose we have historical data on the decisions made by an experienced football trainer about training outdoors (outside the gym) or not with her team, including the weather conditions on the days when the decisions were made.
A typical dataset could look as follows:

The dataset was specifically created for this example and, of course, might not represent any real decisions.
In this example, the target variable is Train outside and the rest of the variables are the model features.
According to the data table, a possible decision tree would be as follows:

We choose to start splitting the data by the value of the Outlook feature. We can see that if the value is Overcast, then the decision to train outside is always Yes and does not depend on the values of the other features. Sunny and Rainy can be further split to get an answer.
How can we decide which feature to use first and how to continue? We will use the value of the entropy, measuring how much its value changes when considering different input features.
- GitHub Essentials
- Python數據挖掘:入門、進階與實用案例分析
- Word 2010中文版完全自學手冊
- Modern Programming: Object Oriented Programming and Best Practices
- 大數據:規劃、實施、運維
- 大數據Hadoop 3.X分布式處理實戰
- 數據庫技術及應用教程
- 從0到1:JavaScript 快速上手
- 數據庫技術實用教程
- 數據科學實戰指南
- 一本書講透Elasticsearch:原理、進階與工程實踐
- SQL Server深入詳解
- Access 2010數據庫程序設計實踐教程
- Visual Studio 2012 and .NET 4.5 Expert Development Cookbook
- NoSQL數據庫原理(第2版·微課版)