官术网_书友最值得收藏!

Exploration

Exploration involves being able to interactively slice and dice your data to try and make quick discoveries. Exploration can include various tasks such as:

  • Examining how variables relate to each other
  • Determining how the data is distributed
  • Finding and excluding outliers
  • Creating quick visualizations
  • Quickly creating new data representations or models to feed into more permanent and detailed modeling processes

Exploration is one of the great strengths of pandas. While exploration can be performed in most programming languages, each has its own level of ceremony—how much non-exploratory effort must be performedbefore actually getting to discoveries.

When used with the read-eval-print-loop (REPL) nature of IPython and/or Jupyter notebooks, pandas creates an exploratory environment that is almost free of ceremony. The expressiveness of the syntax of pandas lets you describe complex data manipulation constructs succinctly, and the result of every action you take upon your data is immediately presented for your inspection. This allows you to quickly determine the validity of the action you just took without having to recompile and completely rerun your programs.

主站蜘蛛池模板: 永登县| 彰化县| 康保县| 舒城县| 呼图壁县| 丹东市| 凤台县| 泾阳县| 平山县| 西乌珠穆沁旗| 土默特左旗| 茶陵县| 石柱| 南通市| 云林县| 赤水市| 抚顺县| 化隆| 康平县| 巨鹿县| 健康| 湖南省| 宁海县| 调兵山市| 尼勒克县| 石屏县| 牡丹江市| 沁水县| 得荣县| 德兴市| 屯门区| 彭山县| 甘南县| 新源县| 泌阳县| 津南区| 云浮市| 鸡泽县| 宁远县| 巫溪县| 万全县|