官术网_书友最值得收藏!

Binning

Sometimes it's useful to separate feature values into several bins. For example, we may be only interested whether it rained on a particular day. Given the precipitation values, we can binarize the values, so that we get a true value if the precipitation value is not zero, and a false value otherwise. We can also use statistics to divide values into high, low, and medium bins.

The binning process inevitably leads to loss of information. However, depending on your goals this may not be an issue, and actually reduce the chance of overfitting. Certainly there will be improvements in speed and memory or storage requirements.

主站蜘蛛池模板: 色达县| 扬中市| 望都县| 南昌县| 延边| 顺昌县| 若尔盖县| 环江| 五大连池市| 江城| 墨玉县| 普陀区| 卢氏县| 中江县| 广西| 库尔勒市| 固镇县| 盐亭县| 南康市| 大英县| 湘阴县| 黑水县| 屏南县| 松江区| 平陆县| 香河县| 崇信县| 纳雍县| 湟中县| 玉林市| 措美县| 平安县| 屏南县| 什邡市| 阿尔山市| 赞皇县| 琼海市| 浦东新区| 盐池县| 榆树市| 泽普县|