- Practical Data Wrangling
- Allan Visochek
- 262字
- 2021-07-02 15:16:05
R
R is both a programming language and an environment built specifically for statistical computing. This definition has been taken from the R website, r-project.org/about.html:
In other words, one of the major differences between R and Python is that some of the most common functionalities for working with data--data handling and storage, visualization, statistical computation, and so on--come built in. A good example of this is linear modeling, a basic statistical method for modelling numerical data.
In R, linear modeling is a built-in functionality that is made very intuitive and straightforward, as we will see in Chapter 5, Manipulating Text Data - An Introduction to Regular Expressions. There are a number of ways to do linear modeling in Python, but they all require using external libraries and often doing extra work to get the data in the right format.
R also has a built-in data structure called a dataframe that can make manipulation of tabular data more intuitive.
The big takeaway here is that there are benefits and trade-offs to both languages. In general, being able to use the right tool for the job can save an immense amount of time spent on data wrangling. It is therefore quite useful as a data programmer to have a good working knowledge of each language and know when to use one or the other.
- Internet接入·網(wǎng)絡(luò)安全
- Microsoft Power BI Quick Start Guide
- 嵌入式系統(tǒng)應(yīng)用
- Introduction to DevOps with Kubernetes
- 工業(yè)機器人技術(shù)及應(yīng)用
- Windows 8應(yīng)用開發(fā)實戰(zhàn)
- STM32G4入門與電機控制實戰(zhàn):基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現(xiàn)
- 工業(yè)機器人操作與編程
- Implementing Oracle API Platform Cloud Service
- Ceph:Designing and Implementing Scalable Storage Systems
- 完全掌握AutoCAD 2008中文版:機械篇
- 工業(yè)機器人應(yīng)用案例集錦
- 面向?qū)ο蟪绦蛟O(shè)計綜合實踐
- 貫通Java Web輕量級應(yīng)用開發(fā)
- 簡明學(xué)中文版Flash動畫制作