官术网_书友最值得收藏!

Importing data in R

Data comes in various formats. Most of the data available online can be downloaded in the form of text documents (.txt extension) or as comma-separated values (.csv). We also encounter data in the tab-delimited format, XLS, HTML, JSON, XML, and so on. If you are interested in working with data, either in JSON or XML, refer to the recipe Constructing a bar plot using XML in R in Chapter 10, Creating Applications in R.

How to do it...

In order to import a CSV file in R, we can use the read.csv() function:

test = read.csv("raw.csv", sep = ",", header = TRUE)

Alternatively, read.table() function allows us to import data with different separators and formats. Following are some of the methods used to import data in R:

How to do it...

How it works…

The first argument in the read.csv() function is the filename, followed by the separator used in the file. The header = TRUE argument is used to instruct R that the file contains headers. Please note that R will search for this file in its current directory. We have to specify the directory containing the file using the setwd() function. Alternatively, we can navigate and set our working directory by navigating to Sessions | Set working directory | Choose directory.

The first argument in the read.table() function is the filename that contains the data, the second argument states that the data contains the header, and the third argument is related to the separator. If our data consists of a semi colon (;), a tab delimited, or the @ symbol as a separator, we can specify this under the sep ="" argument. Note that, to specify a separator as a tab delimited, users would have to substitute sep = "," with sep ="\t" in the read.table() function.

One of the other useful arguments is the row.names argument. If we omit row.names, R will use the column serial numbers as row.names. We can assign row.names for our data by specifying it as row.names = c("Name").

主站蜘蛛池模板: 永新县| 类乌齐县| 乌拉特前旗| 和政县| 广灵县| 吴堡县| 延川县| 景东| 渭源县| 赫章县| 泰顺县| 伊宁县| 滁州市| 康乐县| 新巴尔虎右旗| 宁远县| 泰来县| 东至县| 卢氏县| 宜章县| 米泉市| 峨山| 高雄市| 邓州市| 太原市| 定兴县| 西吉县| 永川市| 徐汇区| 新郑市| 西峡县| 柳江县| 罗平县| 祁阳县| 治多县| 丰镇市| 洛南县| 吴堡县| 盐津县| 清流县| 昆明市|