- Python Machine Learning By Example
- Yuxi (Hayden) Liu
- 106字
- 2021-07-02 22:57:19
Binning
Sometimes it's useful to separate feature values into several bins. For example, we may be only interested whether it rained on a particular day. Given the precipitation values, we can binarize the values, so that we get a true value if the precipitation value is not zero, and a false value otherwise. We can also use statistics to divide values into high, low, and medium bins.
The binning process inevitably leads to loss of information. However, depending on your goals this may not be an issue, and actually reduce the chance of overfitting. Certainly there will be improvements in speed and memory or storage requirements.
推薦閱讀
- 程序員面試筆試寶典(第3版)
- Learning C++ Functional Programming
- Wireshark Network Security
- Practical Windows Forensics
- Python Data Analysis(Second Edition)
- Apache Mahout Clustering Designs
- Swift細致入門與最佳實踐
- 大數據分析與應用實戰:統計機器學習之數據導向編程
- Terraform:多云、混合云環境下實現基礎設施即代碼(第2版)
- Vue.js 2 Web Development Projects
- Django 3.0入門與實踐
- Odoo 10 Implementation Cookbook
- Everyday Data Structures
- Unity Android Game Development by Example Beginner's Guide
- Python Web自動化測試設計與實現