官术网_书友最值得收藏!

Data Basics

In this chapter, we'll first discuss sources of open data, which includes the University of California at Irvine (UCI) Machine Learning Depository, the Bureau of Labor Statistics, the Census Bureau, Professor French's Data Library, and the Federal Reserve's Data Library. Then, we will show you several ways of inputting data, how to deal with missing values, sorting, choosing a subset, merging different datasets, and data output. For different languages, such as Python, R, and Julia, several relevant packages for data manipulation will be introduced as well. In particular, the Python pandas package will be discussed.

In this chapter, the following topics will be covered:

  • Sources of data
  • Introduction to the Python pandas package
  • Several ways to inputting packages
  • Introduction to the Quandl data delivery platform
  • Dealing with missing data
  • Sorting data, as well as how to slice, dice, and merge various datasets
  • Introduction to Python packages: cbsodata and datadotword
  • Introduction to R packages: dslabs, haven, and foreign
  • Generating Python datasets
  • Generating R datasets
主站蜘蛛池模板: 平安县| 米易县| 新和县| 鹤山市| 海阳市| 松江区| 灵石县| 东平县| 哈尔滨市| 富阳市| 三原县| 南昌县| 霸州市| 五华县| 怀柔区| 宣恩县| 白沙| 苗栗县| 齐河县| 桑日县| 庄河市| 湄潭县| 彰武县| 莱西市| 凤凰县| 钟山县| 贡觉县| 永年县| 建瓯市| 鹿邑县| 江门市| 大竹县| 绩溪县| 诏安县| 南陵县| 广宁县| 怀集县| 夏津县| 黄浦区| 嘉兴市| 梅河口市|