- Python Machine Learning By Example
- Yuxi (Hayden) Liu
- 136字
- 2021-07-02 12:41:35
Scaling
Values of different features can differ by orders of magnitude. Sometimes, this may mean that the larger values dominate the smaller values. This depends on the algorithm we're using. For certain algorithms to work properly, we're required to scale the data.
There are following several common strategies that we can apply:
- Standardization removes the mean of a feature and divides by the standard deviation. If the feature values are normally distributed, we'll get a Gaussian, which is centered around zero with a variance of one.
- If the feature values aren't normally distributed, we can remove the median and divide by the interquartile range. The interquartile range is a range between the first and third quartile (or 25th and 75th percentile).
- Scaling features to a range is a common choice of range between zero and one.
推薦閱讀
- LabVIEW虛擬儀器從入門到測控應(yīng)用130例
- 自動檢測與轉(zhuǎn)換技術(shù)
- STM32嵌入式微控制器快速上手
- 工業(yè)機(jī)器人操作與編程
- 基于32位ColdFire構(gòu)建嵌入式系統(tǒng)
- 網(wǎng)站前臺設(shè)計(jì)綜合實(shí)訓(xùn)
- Linux系統(tǒng)下C程序開發(fā)詳解
- 網(wǎng)絡(luò)服務(wù)器搭建與管理
- 青少年VEX IQ機(jī)器人實(shí)訓(xùn)課程(初級)
- 中老年人學(xué)電腦與上網(wǎng)
- 系統(tǒng)安裝、維護(hù)與數(shù)據(jù)備份技巧
- 歐姆龍PLC應(yīng)用系統(tǒng)設(shè)計(jì)實(shí)例精解
- Raspberry Pi 3 Projects for Java Programmers
- TensorFlow 2.0卷積神經(jīng)網(wǎng)絡(luò)實(shí)戰(zhàn)
- 巧學(xué)活用WPS