官术网_书友最值得收藏!

Binning

Sometimes it's useful to separate feature values into several bins. For example, we may be only interested whether it rained on a particular day. Given the precipitation values, we can binarize the values, so that we get a true value if the precipitation value is not zero, and a false value otherwise. We can also use statistics to divide values into high, low, and medium bins.

The binning process inevitably leads to loss of information. However, depending on your goals this may not be an issue, and actually reduce the chance of overfitting. Certainly there will be improvements in speed and memory or storage requirements.

主站蜘蛛池模板: 岗巴县| 秭归县| 蓬安县| 尉氏县| 博乐市| 祁连县| 大同市| 枞阳县| 康乐县| 临安市| 武清区| 怀化市| 慈利县| 诸暨市| 泽州县| 长泰县| 汕头市| 榕江县| 岗巴县| 工布江达县| 盖州市| 喀喇| 邮箱| 舟曲县| 大冶市| 莱西市| 襄汾县| 广安市| 法库县| 紫金县| 兖州市| 宜丰县| 北川| 松阳县| 工布江达县| 峡江县| 广饶县| 鲁甸县| 永昌县| 琼中| 康平县|