官术网_书友最值得收藏!

Data Basics

In this chapter, we'll first discuss sources of open data, which includes the University of California at Irvine (UCI) Machine Learning Depository, the Bureau of Labor Statistics, the Census Bureau, Professor French's Data Library, and the Federal Reserve's Data Library. Then, we will show you several ways of inputting data, how to deal with missing values, sorting, choosing a subset, merging different datasets, and data output. For different languages, such as Python, R, and Julia, several relevant packages for data manipulation will be introduced as well. In particular, the Python pandas package will be discussed.

In this chapter, the following topics will be covered:

  • Sources of data
  • Introduction to the Python pandas package
  • Several ways to inputting packages
  • Introduction to the Quandl data delivery platform
  • Dealing with missing data
  • Sorting data, as well as how to slice, dice, and merge various datasets
  • Introduction to Python packages: cbsodata and datadotword
  • Introduction to R packages: dslabs, haven, and foreign
  • Generating Python datasets
  • Generating R datasets
主站蜘蛛池模板: 临泉县| 勃利县| 孝义市| 濮阳市| 隆尧县| 郑州市| 丽江市| 大石桥市| 阳山县| 曲沃县| 耒阳市| 囊谦县| 浮山县| 天门市| 西贡区| 来安县| 新野县| 周口市| 黔江区| 桐庐县| 东平县| 秭归县| 麟游县| 嘉禾县| 广东省| 清新县| 赤峰市| 阿拉善左旗| 定西市| 丘北县| 隆安县| 邢台县| 雷山县| 云阳县| 枝江市| 桑日县| 太和县| 临洮县| 临泉县| 景泰县| 黔江区|