- Data Science for Marketing Analytics
- Tommy Blanchard Debasish Behera Pranshu Bhatnagar
- 188字
- 2021-06-11 13:45:52
Summary
Data processing and wrangling is the initial, and a very important, part of the data science pipeline. It is generally helpful if people preparing data have some domain knowledge about the data, since that will help them stop at the right processing point and use their intuition to build the pipeline better and more quickly. Data processing also requires coming up with innovative solutions and hacks.
In this chapter, you learned how to structure large datasets by arranging them in a tabular form. Then, we got this tabular data into pandas and distributed it between the right columns. Once we were sure that our data was arranged correctly, we combined it with other data sources. We also got rid of duplicates and needless columns, and finally, dealt with missing data. After performing these steps, our data was made ready for analysis and could be put into a data science pipeline directly.
In the next chapter, we will deepen our understanding of pandas and talk about reshaping and analyzing DataFrames for better visualizations and summarizing data. We will also see how to directly solve generic business-critical problems efficiently.
- 機器學習及應用(在線實驗+在線自測)
- Circos Data Visualization How-to
- 蕩胸生層云:C語言開發修行實錄
- WordPress Theme Development Beginner's Guide(Third Edition)
- 自動生產線的拆裝與調試
- 運動控制器與交流伺服系統的調試和應用
- Excel 2007技巧大全
- 云原生架構進階實戰
- SMS 2003部署與操作深入指南
- Machine Learning Algorithms(Second Edition)
- 基于ARM9的小型機器人制作
- 網絡脆弱性掃描產品原理及應用
- 重估:人工智能與賦能社會
- Mastering MongoDB 4.x
- Kubernetes on AWS