- Python Machine Learning By Example
- Yuxi (Hayden) Liu
- 143字
- 2021-07-02 22:57:19
Scaling
Values of different features can differ by orders of magnitude. Sometimes this may mean that the larger values dominate the smaller values. This depends on the algorithm we are using. For certain algorithms to work properly we are required to scale the data. There are several common strategies that we can apply:
- Standardization removes the mean of a feature and divides by the standard deviation. If the feature values are normally distributed, we will get a Gaussian, which is centered around zero with a variance of one.
- If the feature values are not normally distributed, we can remove the median and divide by the interquartile range. The interquartile range is a range between the first and third quartile (or 25th and 75th percentile).
- Scaling features to a range is a common choice of range which is a range between zero and one.
推薦閱讀
- ASP.NET Core:Cloud-ready,Enterprise Web Application Development
- UI設(shè)計(jì)基礎(chǔ)培訓(xùn)教程
- Spring 5.0 Microservices(Second Edition)
- Learn Type:Driven Development
- Mastering QGIS
- Python王者歸來
- 精通網(wǎng)絡(luò)視頻核心開發(fā)技術(shù)
- ASP.NET開發(fā)與應(yīng)用教程
- OpenResty完全開發(fā)指南:構(gòu)建百萬級別并發(fā)的Web應(yīng)用
- Web前端應(yīng)用開發(fā)技術(shù)
- Python Interviews
- Python自然語言理解:自然語言理解系統(tǒng)開發(fā)與應(yīng)用實(shí)戰(zhàn)
- Learning Android Application Testing
- Python Programming for Arduino
- Python程序設(shè)計(jì)教程