官术网_书友最值得收藏!

Getting to know your data

For many years, researchers argued about what is more important: data or algorithms. But now, it looks like the importance of data over algorithms is generally accepted among ML specialists. In most cases, we can assume that the one who has better data usually beats those with more advanced algorithms. Garbage in, garbage out—this rule holds true in ML more than anywhere else. To succeed in this domain, one need not only have data, but also needs to know his data and know what to do with it.

ML datasets are usually composed from individual observations, called samples, cases, or data points. In the simplest case, each sample has several features.

主站蜘蛛池模板: 昌江| 九龙县| 福鼎市| 云和县| 科技| 黎川县| 锦屏县| 历史| 普洱| 和平区| 乳山市| 千阳县| 宁陵县| 息烽县| 邵阳县| 南华县| 潞西市| 正宁县| 旬邑县| 正宁县| 旬邑县| 临沧市| 剑川县| 白银市| 疏勒县| 错那县| 沾化县| 芮城县| 望都县| 柳林县| 友谊县| 大名县| 武汉市| 扎鲁特旗| 镇安县| 芜湖市| 凤庆县| 缙云县| 林西县| 阳朔县| 大石桥市|