官术网_书友最值得收藏!

Deciding whether to train outdoors depending on the weather

Let's suppose we have historical data on the decisions made by an experienced football trainer about training outdoors (outside the gym) or not with her team, including the weather conditions on the days when the decisions were made.

A typical dataset could look as follows:

The dataset was specifically created for this example and, of course, might not represent any real decisions.

In this example, the target variable is Train outside and the rest of the variables are the model features.

According to the data table, a possible decision tree would be as follows:

We choose to start splitting the data by the value of the Outlook feature. We can see that if the value is Overcast, then the decision to train outside is always Yes and does not depend on the values of the other features. Sunny and Rainy can be further split to get an answer. 

How can we decide which feature to use first and how to continue? We will use the value of the entropy, measuring how much its value changes when considering different input features.

主站蜘蛛池模板: 顺昌县| 沈丘县| 巴青县| 天峨县| 沅陵县| 日照市| 育儿| 香格里拉县| 岐山县| 湖南省| 泽普县| 广丰县| 横山县| 吉安县| 车致| 广德县| 天气| 闽侯县| 通山县| 宜宾县| 大关县| 堆龙德庆县| 襄垣县| 瑞金市| 雷波县| 喜德县| 夏河县| 新建县| 博罗县| 兴宁市| 西充县| 蓝山县| 上饶县| 邹城市| 章丘市| 临猗县| 楚雄市| 白玉县| 保靖县| 江都市| 拜城县|