- Hands-On Data Science with Anaconda
- Dr. Yuxing Yan James Yan
- 167字
- 2021-06-25 21:08:48
Data Basics
In this chapter, we'll first discuss sources of open data, which includes the University of California at Irvine (UCI) Machine Learning Depository, the Bureau of Labor Statistics, the Census Bureau, Professor French's Data Library, and the Federal Reserve's Data Library. Then, we will show you several ways of inputting data, how to deal with missing values, sorting, choosing a subset, merging different datasets, and data output. For different languages, such as Python, R, and Julia, several relevant packages for data manipulation will be introduced as well. In particular, the Python pandas package will be discussed.
In this chapter, the following topics will be covered:
- Sources of data
- Introduction to the Python pandas package
- Several ways to inputting packages
- Introduction to the Quandl data delivery platform
- Dealing with missing data
- Sorting data, as well as how to slice, dice, and merge various datasets
- Introduction to Python packages: cbsodata and datadotword
- Introduction to R packages: dslabs, haven, and foreign
- Generating Python datasets
- Generating R datasets
推薦閱讀
- 腦動力:C語言函數(shù)速查效率手冊
- 大數(shù)據(jù)專業(yè)英語
- TIBCO Spotfire:A Comprehensive Primer(Second Edition)
- PIC單片機(jī)C語言非常入門與視頻演練
- Hands-On Cybersecurity with Blockchain
- RPA(機(jī)器人流程自動化)快速入門:基于Blue Prism
- Visual C++編程全能詞典
- JavaScript典型應(yīng)用與最佳實(shí)踐
- 網(wǎng)絡(luò)化分布式系統(tǒng)預(yù)測控制
- 中國戰(zhàn)略性新興產(chǎn)業(yè)研究與發(fā)展·工業(yè)機(jī)器人
- 電氣控制與PLC技術(shù)應(yīng)用
- 邊緣智能:關(guān)鍵技術(shù)與落地實(shí)踐
- Visual C++項(xiàng)目開發(fā)案例精粹
- Raspberry Pi Projects for Kids
- Oracle 11g Anti-hacker's Cookbook