官术网_书友最值得收藏!

Making predictions

We use the predict function to get outcome labels for two samples. The first one is light-black, fluffy creature, 24 meters long. The second one is purple polka dot, non-fluffy, and 34 meters long. If you already don't remember the meaning of each feature, consult the feature_names variable:

In []: 
samples = [[24,1,0,1,0,0], [34,0,0,0,1,0]] 
tree_model.predict(samples) 
Out[]: 
array([u'platyhog', u'rabbosaurus'], dtype=object) 

Our model predicted platyhog for the first sample, and rabbosaurus for the second one. A decision tree can also provide probabilistic output (how sure it is about the prediction):

In []: 
tree_model.predict_proba(samples) 
Out[]: 
array([[ 1.,  0.], 
       [ 0.,  1.]]) 

The array contains two nested arrays, one for every prediction. Elements in the nested arrays are probabilities of the sample belonging to the corresponding class. This means that our model is 100% sure that the first sample belongs to the first class, and 100% sure that the second sample belongs to the second class.

But how sure can we be about these predictions? We have a whole set of different tools to evaluate the model's accuracy, and the simplest one is the built-in scoring functions.

主站蜘蛛池模板: 昭通市| 尖扎县| 镇安县| 澎湖县| 古蔺县| 霍城县| 兴安县| 宣城市| 嘉祥县| 兴安盟| 新和县| 万州区| 河间市| 瓦房店市| 旅游| 巴青县| 绵竹市| 丹东市| 会同县| 喀喇| 屏东市| 新营市| 蓝山县| 当雄县| 奉新县| 临猗县| 大名县| 金平| 东乌珠穆沁旗| 巢湖市| 阳山县| 离岛区| 澎湖县| 赤峰市| 新泰市| 屏山县| 镇原县| 丹东市| 商都县| 永清县| 沙雅县|