官术网_书友最值得收藏!

Preface

Data is a collection of discrete objects, events, and facts in the form of numbers, text, pictures, videos, objects, audio, and other entities. Processing data provides a great deal of information. But the million-dollar question is—how do we get meaningful information from data? The answer to this question is Exploratory Data Analysis (EDA), which is the process of investigating datasets, elucidating subjects, and visualizing outcomes. EDA is an approach to data analysis that applies a variety of techniques to maximize specific insights into a dataset, reveal an underlying structure, extract significant variables, detect outliers and anomalies, test assumptions, develop models, and determine best parameters for future estimations. This book, Hands-On Exploratory Data Analysis with Python, aims to provide practical knowledge about the main pillars of EDA, including data cleansing, data preparation, data exploration, and data visualization. Why visualization? Well, several research studies have shown that portraying data in graphical form makes complex statistical data analyses and business intelligence more marketable. 

You will get the opportunity to explore open source datasets including healthcare datasets, demographics datasets, a Titanic dataset, a wine quality dataset, automobile datasets, a Boston housing pricing dataset, and many others. Using these real-life datasets, you will get hands-on practice in understanding data, summarize data's characteristics, and visualizing data for business intelligence purposes. This book expects you to use pandas, a powerful library for working with data, and other core Python libraries including NumPy, scikit-learn, SciPyStatsModels for regression, and Matplotlib for visualization.

主站蜘蛛池模板: 普洱| 碌曲县| 固镇县| 正蓝旗| 平武县| 塘沽区| 濉溪县| 三门县| 南丰县| 古丈县| 大埔区| 饶河县| 徐州市| 阜宁县| 怀柔区| 朝阳区| 仁化县| 广丰县| 调兵山市| 清河县| 彝良县| 微山县| 廊坊市| 奉节县| 岳西县| 上思县| 呼玛县| 阳高县| 珠海市| 民勤县| 邵武市| 凤阳县| 钦州市| 宾川县| 阿坝| 台州市| 清新县| 横山县| 琼中| 枣强县| 奉化市|