官术网_书友最值得收藏!

Balanced dataset

The application allows you to record samples of different motion types. As you train the model, you may notice one interesting effect: to get accurate predictions, you need not only enough samples, but you also need the proportion of different classes in your dataset to be roughly equal. Think about it: if you have 100 samples of two classes (walk and run), and 99 of them belong to one class (walk), the classifier that delivers 99% accuracy may look like this:

func predict(x: [Double]) -> MotionType { 
   return .walk 
} 

But this is not what we want, obviously.

This observation lead us to the notion of the balanced data set; for most machine learning algorithms, you want the data set in which samples of different classes are represented equally frequently.

主站蜘蛛池模板: 淮南市| 文成县| 永定县| 苍山县| 石台县| 马关县| 郧西县| 军事| 兴义市| 利辛县| 噶尔县| 铁岭县| 岑巩县| 包头市| 丹棱县| 垣曲县| 屯昌县| 石景山区| 桃江县| 报价| 城固县| 潜山县| 那坡县| 冷水江市| 大安市| 无锡市| 寿宁县| 嘉荫县| 政和县| 曲阜市| 咸丰县| 远安县| 黄冈市| 红安县| 北票市| 婺源县| 周口市| 万年县| 达尔| 安多县| 汉源县|