- Haskell Data Analysis Cookbook
- Nishant Shukla
- 170字
- 2021-12-08 12:43:34
Introduction

The conclusions drawn from data analysis are only as robust as the quality of the data itself. After obtaining raw text, the next natural step is to validate and clean it carefully. Even the slightest bias may risk the integrity of the results. Therefore, we must take great precautionary measures, which involve thorough inspection, to ensure sanity checks are performed on our data before we begin to understand it. This section should be the starting point for cleaning data in Haskell.
Real-world data often has an impurity that needs to be addressed before it can be processed. For example, extraneous whitespaces or punctuation could clutter data, making it difficult to parse. Duplication and data conflicts are another area of unintended consequences of reading real-world data. Sometimes it's just reassuring to know that data makes sense by conducting sanity checks. Some examples of sanity checks include matching regular expressions as well as detecting outliers by establishing a measure of distance. In this chapter, we will cover each of these topics.
- Spring 5企業級開發實戰
- TensorFlow Lite移動端深度學習
- Java系統分析與架構設計
- WebAssembly實戰
- Java完全自學教程
- Manga Studio Ex 5 Cookbook
- 認識編程:以Python語言講透編程的本質
- Spring Boot+Spring Cloud+Vue+Element項目實戰:手把手教你開發權限管理系統
- RTC程序設計:實時音視頻權威指南
- Java開發入行真功夫
- Julia Cookbook
- INSTANT Yii 1.1 Application Development Starter
- Django 3.0應用開發詳解
- 深入實踐Kotlin元編程
- Python大學實用教程