- Hands-On Machine Learning with JavaScript
- Burak Kanber
- 212字
- 2021-06-25 21:38:21
Cleaning and preparing data
Feature selection is not the only consideration required when preprocessing your data. There are many other things that you may need to do to prepare your data for the algorithm that will ultimately analyze the data. Perhaps there are measurement errors that create significant outliers. There can also be instrumentation noise in the data that needs to be smoothed out. Your data may have missing values for some features. These are all issues that can either be ignored or addressed, depending, as always, on the context, the data, and the algorithm involved.
Additionally, the algorithm you use may require the data to be normalized to some range of values. Or perhaps your data is in a different format that the algorithm cannot use, as is often the case with neural networks which expect you to provide a vector of values, but you have JSON objects that come from a database. Sometimes you need to analyze only a specific subset of data from a larger source. If you're working with images you may need to resize, scale, pad, crop, or reduce the image to grayscale.
These tasks all fall into the realm of data preprocessing. Let's take a look at some specific scenarios and discuss possible approaches for each.
- Excel 2007函數與公式自學寶典
- Spark編程基礎(Scala版)
- Dreamweaver 8中文版商業案例精粹
- Security Automation with Ansible 2
- Google App Inventor
- Maya 2012從入門到精通
- STM32嵌入式微控制器快速上手
- 計算機網絡安全
- 電腦主板現場維修實錄
- Machine Learning with Apache Spark Quick Start Guide
- LMMS:A Complete Guide to Dance Music Production Beginner's Guide
- 和機器人一起進化
- 典型Hadoop云計算
- 貫通Hibernate開發
- Instant Slic3r