- Advanced Machine Learning with R
- Cory Lesmeister Dr. Sunil Kumar Chinnamgari
- 220字
- 2021-06-24 14:24:32
Preparing and Understanding Data
Research consistently shows that machine learning and data science practitioners spend most of their time manipulating data and preparing it for analysis. Indeed, many find it the most tedious and least enjoyable part of their work. Numerous companies are offering solutions to the problem but, in my opinion, results at this point are varied. Therefore, in this first chapter, I shall endeavor to provide a way of tackling the problem that will ease the burden of getting your data ready for machine learning. The methodology introduced in this chapter will serve as the foundation for data preparation and for understanding many of the subsequent chapters. I propose that once you become comfortable with this tried and true process, it may very well become your favorite part of machine learning—as it is for me.
The following are the topics that we'll cover in this chapter:
- Overview
- Reading the data
- Handling duplicate observations
- Descriptive statistics
- Exploring categorical variables
- Handling missing values
- Zero and near-zero variance features
- Treating the data
- Correlation and linearity
- Intel FPGA/CPLD設計(基礎篇)
- 筆記本電腦使用、維護與故障排除實戰(zhàn)
- 圖解西門子S7-200系列PLC入門
- 電腦組裝與維修從入門到精通(第2版)
- 嵌入式系統(tǒng)設計教程
- 單片機原理及應用系統(tǒng)設計
- Hands-On Machine Learning with C#
- Machine Learning Solutions
- 深入理解序列化與反序列化
- Python Machine Learning Blueprints
- IP網(wǎng)絡視頻傳輸:技術、標準和應用
- 嵌入式系統(tǒng)原理及應用:基于ARM Cortex-M4體系結構
- USB應用開發(fā)寶典
- The Applied Artificial Intelligence Workshop
- Service Mesh微服務架構設計