官术网_书友最值得收藏!

Balanced dataset

The application allows you to record samples of different motion types. As you train the model, you may notice one interesting effect: to get accurate predictions, you need not only enough samples, but you also need the proportion of different classes in your dataset to be roughly equal. Think about it: if you have 100 samples of two classes (walk and run), and 99 of them belong to one class (walk), the classifier that delivers 99% accuracy may look like this:

func predict(x: [Double]) -> MotionType { 
   return .walk 
} 

But this is not what we want, obviously.

This observation lead us to the notion of the balanced data set; for most machine learning algorithms, you want the data set in which samples of different classes are represented equally frequently.

主站蜘蛛池模板: 喀什市| 南召县| 扶沟县| 通渭县| 威远县| 安阳县| 浦东新区| 乾安县| 梅河口市| 桐庐县| 台中县| 彩票| 上杭县| 资中县| 南京市| 红安县| 旌德县| 高要市| 仙游县| 遂溪县| 昌吉市| 五大连池市| 长顺县| 阳原县| 团风县| 若尔盖县| 循化| 云浮市| 临夏市| 庆元县| 河源市| 马龙县| 龙口市| 曲麻莱县| 龙江县| 郁南县| 微博| 邢台县| 武城县| 喀喇沁旗| 邛崃市|