- Hands-On Exploratory Data Analysis with R
- Radhika Datar Harish Garg
- 189字
- 2021-06-24 14:10:38
Examining, cleaning, and filtering data
The next steps after importing the data are to examine it and check for missing or erroneous data. We then need to clean the data and apply filters and selections. Different kinds of datasets need different approaches to carry out these steps. R has powerful packages to handle this and some of them are as follows:
- dplyr: dplyr is a powerful R package that provides methods to make examining, cleaning, and filtering data fast and easy.
- tidyr: The tidyr package helps to organize messy data for easier data analysis.
- stringr: The stringr package provides methods and techniques of working with string data efficiently.
- forcats: Factors are widely used while doing data analysis in R. The forcats package makes it easy to work with factors.
- lubridate: lubridate makes wrangling date-time data quick and easy.
- hms: hms is a great package for handling datasets that include data with time of day values.
- blob: Not all data always comes stored in plain ASCII text; you sometimes have to deal with binary data formats. The blob package makes this easy.
推薦閱讀
- Microsoft Power BI Quick Start Guide
- R Data Mining
- Mastercam 2017數(shù)控加工自動(dòng)編程經(jīng)典實(shí)例(第4版)
- Circos Data Visualization How-to
- Maya 2012從入門到精通
- 永磁同步電動(dòng)機(jī)變頻調(diào)速系統(tǒng)及其控制(第2版)
- 21天學(xué)通Visual C++
- Learning C for Arduino
- INSTANT Drools Starter
- 傳感器與新聞
- 電腦上網(wǎng)入門
- Web編程基礎(chǔ)
- 典型Hadoop云計(jì)算
- 計(jì)算機(jī)應(yīng)用基礎(chǔ)實(shí)訓(xùn)·職業(yè)模塊
- 新一代人工智能與語(yǔ)音識(shí)別