官术网_书友最值得收藏!

Deciding whether to train outdoors depending on the weather

Let's suppose we have historical data on the decisions made by an experienced football trainer about training outdoors (outside the gym) or not with her team, including the weather conditions on the days when the decisions were made.

A typical dataset could look as follows:

The dataset was specifically created for this example and, of course, might not represent any real decisions.

In this example, the target variable is Train outside and the rest of the variables are the model features.

According to the data table, a possible decision tree would be as follows:

We choose to start splitting the data by the value of the Outlook feature. We can see that if the value is Overcast, then the decision to train outside is always Yes and does not depend on the values of the other features. Sunny and Rainy can be further split to get an answer. 

How can we decide which feature to use first and how to continue? We will use the value of the entropy, measuring how much its value changes when considering different input features.

主站蜘蛛池模板: 开平市| 张家港市| 丘北县| 平果县| 玉龙| 通城县| 宁国市| 广宗县| 昌都县| 福建省| 大竹县| 胶南市| 札达县| 沈阳市| 古丈县| 汉中市| 宁城县| 十堰市| 绥阳县| 饶平县| 景宁| 通化县| 临邑县| 濮阳县| 普格县| 疏勒县| 刚察县| 朝阳县| 梁山县| 高青县| 明水县| 敖汉旗| 武威市| 苍溪县| 顺平县| 大同市| 海安县| 阿尔山市| 尤溪县| 育儿| 德庆县|