官术网_书友最值得收藏!

Chapter 1. Preparing the Data

In this chapter, we will cover the basic tasks of reading, storing, and cleaning data using Python and OpenRefine. You will learn the following recipes:

  • Reading and writing CSV/TSV files with Python
  • Reading and writing JSON files with Python
  • Reading and writing Excel files with Python
  • Reading and writing XML files with Python
  • Retrieving HTML pages with pandas
  • Storing and retrieving from a relational database
  • Storing and retrieving from MongoDB
  • Opening and transforming data with OpenRefine
  • Exploring the data with OpenRefine
  • Removing duplicates
  • Using regular expressions and GREL to clean up the data
  • Imputing missing observations
  • Normalizing and standardizing features
  • Binning the observations
  • Encoding categorical variables
主站蜘蛛池模板: 鄂托克前旗| 海兴县| 长沙县| 彭州市| 陆良县| 思茅市| 仙桃市| 安福县| 游戏| 睢宁县| 库车县| 乐陵市| 大庆市| 延川县| 天全县| 孙吴县| 监利县| 尉犁县| 龙井市| 班戈县| 苏州市| 东阿县| 台江县| 五寨县| 郯城县| 花垣县| 宣武区| 武陟县| 朝阳区| 巴林右旗| 临朐县| 九江县| 华宁县| 泊头市| 靖安县| 莒南县| 高安市| 株洲县| 石城县| 郯城县| 枣强县|