官术网_书友最值得收藏!

Balanced dataset

The application allows you to record samples of different motion types. As you train the model, you may notice one interesting effect: to get accurate predictions, you need not only enough samples, but you also need the proportion of different classes in your dataset to be roughly equal. Think about it: if you have 100 samples of two classes (walk and run), and 99 of them belong to one class (walk), the classifier that delivers 99% accuracy may look like this:

func predict(x: [Double]) -> MotionType { 
   return .walk 
} 

But this is not what we want, obviously.

This observation lead us to the notion of the balanced data set; for most machine learning algorithms, you want the data set in which samples of different classes are represented equally frequently.

主站蜘蛛池模板: 铅山县| 盐城市| 万年县| 永年县| 渑池县| 武强县| 乌兰浩特市| 玉溪市| 大同市| 吴堡县| 察隅县| 彝良县| 兴宁市| 辽中县| 运城市| 云龙县| 江油市| 台江县| 通城县| 南昌市| 兰西县| 香格里拉县| 枝江市| 霍山县| 任丘市| 永寿县| 光泽县| 神木县| 文登市| 耿马| 富锦市| 宁南县| 通河县| 吉木萨尔县| 蓝山县| 堆龙德庆县| 泗阳县| 新竹县| 宜丰县| 隆昌县| 武山县|