官术网_书友最值得收藏!

Loading the Data into Jupyter Using a Pandas DataFrame

Oftentimes, data is stored in tables, which means it can be saved as a comma-separated variable (CSV) file. This format, and many others, can be read into Python as a DataFrame object, using the Pandas library. Other common formats include tab-separated variable (TSV), SQL tables, and JSON data structures. Indeed, Pandas has support for all of these. In this example, however, we are not going to load the data this way because the dataset is available directly through scikit-learn.

An important part after loading data for analysis is ensuring that it's clean. For example, we would generally need to deal with missing data and ensure that all columns have the correct datatypes. The dataset we use in this section has already been cleaned, so we will not need to worry about this. However, we'll see messier data in the second chapter and explore techniques for dealing with it.
主站蜘蛛池模板: 石屏县| 柯坪县| 壤塘县| 阳泉市| 平顺县| 淅川县| 手机| 平南县| 赤城县| 丽江市| 成武县| 井陉县| 梁山县| 聂拉木县| 金华市| 巩留县| 德清县| 濮阳市| 河北区| 永康市| 逊克县| 连州市| 清涧县| 凤山市| 金昌市| 绥德县| 菏泽市| 凌海市| 迁安市| 渭南市| 横山县| 定远县| 海兴县| 宁远县| 新蔡县| 天台县| 满城县| 青海省| 宝鸡市| 灯塔市| 探索|