官术网_书友最值得收藏!

Balanced dataset

The application allows you to record samples of different motion types. As you train the model, you may notice one interesting effect: to get accurate predictions, you need not only enough samples, but you also need the proportion of different classes in your dataset to be roughly equal. Think about it: if you have 100 samples of two classes (walk and run), and 99 of them belong to one class (walk), the classifier that delivers 99% accuracy may look like this:

func predict(x: [Double]) -> MotionType { 
   return .walk 
} 

But this is not what we want, obviously.

This observation lead us to the notion of the balanced data set; for most machine learning algorithms, you want the data set in which samples of different classes are represented equally frequently.

主站蜘蛛池模板: 长岛县| 江油市| 哈密市| 郴州市| 黑河市| 张家川| 舟曲县| 古丈县| 天祝| 上栗县| 陇南市| 奉新县| 富川| 洞口县| 柳林县| 芦山县| 莲花县| 宜阳县| 三河市| 兰考县| 兴隆县| 毕节市| 庆阳市| 晴隆县| 独山县| 汉中市| 彰化县| 上虞市| 龙口市| 浦县| 蓝田县| 延边| 黄山市| 松溪县| 齐齐哈尔市| 大竹县| 兰考县| 四平市| 富民县| 南澳县| 郯城县|