- Mastering Machine Learning with R
- Cory Lesmeister
- 220字
- 2021-07-02 13:46:18
Preparing and Understanding Data
Research consistently shows that machine learning and data science practitioners spend most of their time manipulating data and preparing it for analysis. Indeed, many find it the most tedious and least enjoyable part of their work. Numerous companies are offering solutions to the problem but, in my opinion, results at this point are varied. Therefore, in this first chapter, I shall endeavor to provide a way of tackling the problem that will ease the burden of getting your data ready for machine learning. The methodology introduced in this chapter will serve as the foundation for data preparation and for understanding many of the subsequent chapters. I propose that once you become comfortable with this tried and true process, it may very well become your favorite part of machine learning—as it is for me.
The following are the topics that we'll cover in this chapter:
- Overview
- Reading the data
- Handling duplicate observations
- Descriptive statistics
- Exploring categorical variables
- Handling missing values
- Zero and near-zero variance features
- Treating the data
- Correlation and linearity
- LabVIEW虛擬儀器從入門到測控應用130例
- 21天學通PHP
- 機器學習與大數據技術
- RPA:流程自動化引領數字勞動力革命
- Visual C++編程全能詞典
- INSTANT Autodesk Revit 2013 Customization with .NET How-to
- Ruby on Rails敏捷開發最佳實踐
- 中國戰略性新興產業研究與發展·智能制造裝備
- Building a BeagleBone Black Super Cluster
- 電腦上網輕松入門
- 軟件構件技術
- Mastering pfSense
- Mastering OpenStack(Second Edition)
- Windows 7來了
- Access 2007數據庫入門與實例應用金典