- Predictive Analytics Using Rattle and Qlik Sense
- Ferran Garcia Pagans
- 249字
- 2021-07-16 13:40:17
Chapter 2. Preparing Your Data
The French term mise en place is used in professional kitchens to describe the practice of chefs organizing and arranging the ingredients up to a point where it is ready to be used. It may be as simple as washing and picking herbs into inpidual leaves or chopping vegetables, or as complicated as caramelizing onions or slow cooking meats.
In the same way, before we start cooking the data or building a predictive model, we need to prepare the ingredients-the data. Our preparation covers three different tasks:
- Loading the data into the analytic tool
- Exploring the data to understand it and to find quality problems with it
- Transforming the data to fix the quality problems
We say that the quality of data is high when it's appropriate for a specific use. In this chapter, we'll describe characteristics of data related to its quality.
As we've seen, our mise en place has three steps. After loading the data, we need to explore it and transform it. Exploring and transforming is an iterative process, but in this book, we'll pide it in two different steps for clarity.
In this chapter, we'll discuss the following topics:
- Datasets and types of variables
- Data quality
- Loading data into Rattle
- Assigning roles to the variables
- Transforming variables to solve data quality problems and to improve data format of our predictive model
In this chapter, we'll cover how we explore the data to understand it and find quality problems.
- Python編程自學(xué)手冊
- 編程的修煉
- Manga Studio Ex 5 Cookbook
- 架構(gòu)不再難(全5冊)
- Lua程序設(shè)計(jì)(第4版)
- Mastering AndEngine Game Development
- SAS數(shù)據(jù)統(tǒng)計(jì)分析與編程實(shí)踐
- 正則表達(dá)式經(jīng)典實(shí)例(第2版)
- Bootstrap 4:Responsive Web Design
- HTML5入門經(jīng)典
- Java高并發(fā)核心編程(卷1):NIO、Netty、Redis、ZooKeeper
- Learning Modular Java Programming
- RocketMQ實(shí)戰(zhàn)與原理解析
- Software Development on the SAP HANA Platform
- 計(jì)算機(jī)組裝與維護(hù)(第二版)