- Haskell Data Analysis Cookbook
- Nishant Shukla
- 170字
- 2021-12-08 12:43:34
Introduction

The conclusions drawn from data analysis are only as robust as the quality of the data itself. After obtaining raw text, the next natural step is to validate and clean it carefully. Even the slightest bias may risk the integrity of the results. Therefore, we must take great precautionary measures, which involve thorough inspection, to ensure sanity checks are performed on our data before we begin to understand it. This section should be the starting point for cleaning data in Haskell.
Real-world data often has an impurity that needs to be addressed before it can be processed. For example, extraneous whitespaces or punctuation could clutter data, making it difficult to parse. Duplication and data conflicts are another area of unintended consequences of reading real-world data. Sometimes it's just reassuring to know that data makes sense by conducting sanity checks. Some examples of sanity checks include matching regular expressions as well as detecting outliers by establishing a measure of distance. In this chapter, we will cover each of these topics.
- 微服務設計(第2版)
- Visual Studio 2012 Cookbook
- 高效微控制器C語言編程
- 前端跨界開發指南:JavaScript工具庫原理解析與實戰
- Learning Informatica PowerCenter 10.x(Second Edition)
- 老“碼”識途
- Apache Kafka Quick Start Guide
- 劍指大數據:企業級數據倉庫項目實戰(在線教育版)
- Babylon.js Essentials
- Buildbox 2.x Game Development
- Magento 2 Beginners Guide
- 交互式程序設計(第2版)
- Mastering Concurrency in Python
- Drupal 8 Development:Beginner's Guide(Second Edition)
- Java EE架構設計與開發實踐