官术网_书友最值得收藏!

Getting to know your data

For many years, researchers argued about what is more important: data or algorithms. But now, it looks like the importance of data over algorithms is generally accepted among ML specialists. In most cases, we can assume that the one who has better data usually beats those with more advanced algorithms. Garbage in, garbage out—this rule holds true in ML more than anywhere else. To succeed in this domain, one need not only have data, but also needs to know his data and know what to do with it.

ML datasets are usually composed from individual observations, called samples, cases, or data points. In the simplest case, each sample has several features.

主站蜘蛛池模板: 临西县| 高台县| 翁源县| 莱州市| 临高县| 大方县| 保山市| 铁岭县| 泸溪县| 黄陵县| 乐业县| 平度市| 泰顺县| 白河县| 龙门县| 德清县| 祁连县| 梓潼县| 灵璧县| 合水县| 桂平市| 图们市| 禄丰县| 台前县| 嵩明县| 绍兴县| 利辛县| 康平县| 合山市| 交城县| 博罗县| 体育| 桂林市| 东乌珠穆沁旗| 资中县| 会泽县| 浦县| 交城县| 鸡东县| 五大连池市| 贺州市|