- Scala for Machine Learning(Second Edition)
- Patrick R. Nicolas
- 165字
- 2021-07-08 10:43:06
Chapter 3. Data Preprocessing
Real-world observations are usually noisy and inconsistent, with missing data. No classification, regression, or clustering model can extract reliable information from data that has not been cleansed, filtered, or analyzed.
Data preprocessing consists of cleaning, filtering, transforming, and normalizing raw observations using statistics in order to correlate features or groups of features, identify trends, model, and filter out noise. The purpose of cleansing raw data is twofold:
- Identify flaws in raw input data
- Provide unsupervised or supervised learning with a clean and reliable dataset
You should not underestimate the power of traditional statistical analysis methods to infer and classify information from textual or unstructured data.
In this chapter, you will learn how to to the following:
- Apply commonly used moving average techniques to detect long-term trends in a time series
- Identify market and sector cycles using the discrete Fourier series
- Leverage the discrete Kalman filter to extract the state of a linear dynamic system from incomplete and noisy observations
推薦閱讀
- C#高級編程(第10版) C# 6 & .NET Core 1.0 (.NET開發經典名著)
- Linux核心技術從小白到大牛
- Unity 2020 Mobile Game Development
- 深入RabbitMQ
- Mastering C++ Multithreading
- Learning C++ by Creating Games with UE4
- Python編程基礎教程
- INSTANT LESS CSS Preprocessor How-to
- C語言程序設計
- Java程序設計(項目教學版)
- MATLAB/Simulink建模與仿真
- Unity Certified Programmer:Exam Guide
- C#入門經典(第7版):C# 6.0 & Visual Studio 2015(.NET開發經典名著)
- 大學計算機基礎
- 小學生Python創意編程(視頻教學版)