- Hands-On Data Science with Anaconda
- Dr. Yuxing Yan James Yan
- 167字
- 2021-06-25 21:08:48
Data Basics
In this chapter, we'll first discuss sources of open data, which includes the University of California at Irvine (UCI) Machine Learning Depository, the Bureau of Labor Statistics, the Census Bureau, Professor French's Data Library, and the Federal Reserve's Data Library. Then, we will show you several ways of inputting data, how to deal with missing values, sorting, choosing a subset, merging different datasets, and data output. For different languages, such as Python, R, and Julia, several relevant packages for data manipulation will be introduced as well. In particular, the Python pandas package will be discussed.
In this chapter, the following topics will be covered:
- Sources of data
- Introduction to the Python pandas package
- Several ways to inputting packages
- Introduction to the Quandl data delivery platform
- Dealing with missing data
- Sorting data, as well as how to slice, dice, and merge various datasets
- Introduction to Python packages: cbsodata and datadotword
- Introduction to R packages: dslabs, haven, and foreign
- Generating Python datasets
- Generating R datasets
推薦閱讀
- Mastering Hadoop 3
- Mastering Proxmox(Third Edition)
- TIBCO Spotfire:A Comprehensive Primer(Second Edition)
- 條碼技術及應用
- ROS機器人編程與SLAM算法解析指南
- 基于多目標決策的數據挖掘方法評估與應用
- Microsoft System Center Confi guration Manager
- SMS 2003部署與操作深入指南
- 生物3D打印:從醫療輔具制造到細胞打印
- 電腦故障排除與維護終極技巧金典
- 51單片機應用程序開發與實踐
- 我的IT世界
- 實戰Hadoop
- 機器人輔助C程序設計
- Pentaho Data Integration Beginner's Guide(Second Edition)