- Big Data Analytics with R
- Simon Walkowiak
- 225字
- 2021-07-14 10:42:43
Chapter 2. Introduction to R Programming Language and Statistical Environment
In Chapter 1, The era of "Big Data", you have become familiar with the most useful Big Data terminology, and a small selection of typical tools applied to unusually large or complex data sets. You have also gained essential insights into how R was developed and how it became the leading statistical computing environment and programming language favored by technology giants and the best universities in the world. In this chapter you will have the opportunity to learn some most important R functions from base R installation and well-known third party packages used for data crunching, transformation, and analysis. More specifically in this chapter you will:
- Understand the landscape of available R data structures
- Be guided through a number of R operations allowing you to import data from standard and proprietary data formats
- Carry out essential data cleaning and processing activities such as subsetting, aggregating, creating contingency tables, and so on
- Inspect the data by implementing a selection of Exploratory Data Analysis techniques such as descriptive statistics
- Apply basic statistical methods to estimate correlation parameters between two (Pearson's r) or more variables (multiple regressions) or find the differences between means for two (t-tests) or more groups Analysis of Variance (ANOVA)
- Be introduced to more advanced data modeling tasks like logistic and Poisson regressions
推薦閱讀
- Docker進(jìn)階與實(shí)戰(zhàn)
- Leap Motion Development Essentials
- Python深度學(xué)習(xí)
- 匯編語言程序設(shè)計(jì)(第2版)
- Building a Recommendation Engine with Scala
- MySQL數(shù)據(jù)庫(kù)基礎(chǔ)實(shí)例教程(微課版)
- Reactive Android Programming
- C#實(shí)踐教程(第2版)
- Learning Laravel's Eloquent
- Java語言程序設(shè)計(jì)教程
- Building Wireless Sensor Networks Using Arduino
- MySQL程序員面試筆試寶典
- R Data Science Essentials
- Java編程從入門到精通
- Django Design Patterns and Best Practices