- Data Analysis with R
- Tony Fischetti
- 421字
- 2021-07-30 09:55:09
What this book covers
Chapter 1, RefresheR, reviews the aspects of R that subsequent chapters will assume knowledge of. Here, we learn the basics of R syntax, learn R's major data structures, write functions, load data and install packages.
Chapter 2, The Shape of Data, discusses univariate data. We learn about different data types, how to describe univariate data, and how to visualize the shape of these data.
Chapter 3, Describing Relationships, goes on to the subject of multivariate data. In particular, we learn about the three main classes of bivariate relationships and learn how to describe them.
Chapter 4, Probability, kicks off a new unit by laying foundation. We learn about basic probability theory, Bayes' theorem, and probability distributions.
Chapter 5, Using Data to Reason About the World, discusses sampling and estimation theory. Through examples, we learn of the central limit theorem, point estimation and confidence intervals.
Chapter 6, Testing Hypotheses, introduces the subject of Null Hypothesis Significance Testing (NHST). We learn many popular hypothesis tests and their non-parametric alternatives. Most importantly, we gain a thorough understanding of the misconceptions and gotchas of NHST.
Chapter 7, Bayesian Methods, introduces an alternative to NHST based on a more intuitive view of probability. We learn the advantages and drawbacks of this approach, too.
Chapter 8, Predicting Continuous Variables, thoroughly discusses linear regression. Before the chapter's conclusion, we learn all about the technique, when to use it, and what traps to look out for.
Chapter 9, Predicting Categorical Variables, introduces four of the most popular classification techniques. By using all four on the same examples, we gain an appreciation for what makes each technique shine.
Chapter 10, Sources of Data, is all about how to use different data sources in R. In particular, we learn how to interface with databases, and request and load JSON and XML via an engaging example.
Chapter 11, Dealing with Messy Data, introduces some of the snags of working with less than perfect data in practice. The bulk of this chapter is dedicated to missing data, imputation, and identifying and testing for messy data.
Chapter 12, Dealing with Large Data, discusses some of the techniques that can be used to cope with data sets that are larger than can be handled swiftly without a little planning. The key components of this chapter are on parallelization and Rcpp.
Chapter 13, Reproducibility and Best Practices, closes with the extremely important (but often ignored) topic of how to use R like a professional. This includes learning about tooling, organization, and reproducibility.
- ASP.NET Core:Cloud-ready,Enterprise Web Application Development
- OpenStack Cloud Computing Cookbook(Third Edition)
- C++程序設(shè)計(jì)(第3版)
- MongoDB for Java Developers
- Vue.js入門與商城開發(fā)實(shí)戰(zhàn)
- C#程序設(shè)計(jì)(慕課版)
- Implementing Cisco Networking Solutions
- Easy Web Development with WaveMaker
- Node.js全程實(shí)例
- Xcode 6 Essentials
- C++程序設(shè)計(jì)教程
- 現(xiàn)代CPU性能分析與優(yōu)化
- Microsoft Dynamics GP 2013 Cookbook
- Python實(shí)戰(zhàn)指南:手把手教你掌握300個(gè)精彩案例
- 亮劍Java Web項(xiàng)目開發(fā)案例導(dǎo)航