官术网_书友最值得收藏!

Exploration

Exploration involves being able to interactively slice and dice your data to try and make quick discoveries. Exploration can include various tasks such as:

  • Examining how variables relate to each other
  • Determining how the data is distributed
  • Finding and excluding outliers
  • Creating quick visualizations
  • Quickly creating new data representations or models to feed into more permanent and detailed modeling processes

Exploration is one of the great strengths of pandas. While exploration can be performed in most programming languages, each has its own level of ceremony—how much non-exploratory effort must be performedbefore actually getting to discoveries.

When used with the read-eval-print-loop (REPL) nature of IPython and/or Jupyter notebooks, pandas creates an exploratory environment that is almost free of ceremony. The expressiveness of the syntax of pandas lets you describe complex data manipulation constructs succinctly, and the result of every action you take upon your data is immediately presented for your inspection. This allows you to quickly determine the validity of the action you just took without having to recompile and completely rerun your programs.

主站蜘蛛池模板: 岳池县| 徐水县| 潜江市| 石台县| 沧州市| 新晃| 内乡县| 博野县| 宁乡县| 临城县| 双流县| 肃北| 云梦县| 乌拉特后旗| 甘肃省| 临沭县| 开阳县| 额济纳旗| 宜春市| 周口市| 桐梓县| 洛川县| 阿鲁科尔沁旗| 元谋县| 特克斯县| 集安市| 南靖县| 平舆县| 微山县| 开封县| 新干县| 通城县| 进贤县| 双鸭山市| 宜春市| 德州市| 伊春市| 定襄县| 永胜县| 济源市| 育儿|