官术网_书友最值得收藏!

Analyzing Different Datasets

We all love to talk about the weather. So, let's work with some weather-related datasets. The datasets contain approximately five years' worth of high-temporal resolution (hourly measurements) data for various weather attributes, such as temperature, humidity, air pressure, and so on. We'll analyze and compare the humidity and weather datasets.

Let's begin by implementing the following steps:

  1. Load the humidity dataset by using the following command:
df_hum <- read.csv("data/historical-hourly-weather-data/humidity.csv")
  1. Load the weather description dataset by using the following command:
df_desc <- read.csv("data/historical-hourly-weather-data/weather_description.csv")
  1. Compare the two datasets by using the str command.

The outcome will be the humidity levels of different cities, as follows:

The weather descriptions of different cities are shown as follows:

The different geometric objects that we will be working with in this chapter are as follows:

One-dimensional objects are used to understand and visualize the characteristics of a single variable, as follows:

  • Histogram
  • Bar chart

Two-dimensional objects are used to visualize the relationship between two variables, as follows:

  • Bar chart
  • Boxplot
  • Line chart
  • Scatter plot

Although geometric objects are also used in base R, they don't follow the structure of the Grammar of Graphics and have different naming conventions, as compared to ggplot2. This is an important distinction, which we will look at in detail later.

主站蜘蛛池模板: 京山县| 鄂托克前旗| 岫岩| 社会| 泸州市| 龙州县| 兴文县| 泾阳县| 安义县| 台南县| 定南县| 浦东新区| 霸州市| 秦安县| 凤山市| 奇台县| 金山区| 中江县| 青川县| 阿拉善右旗| 肥西县| 墨脱县| 中卫市| 金门县| 鱼台县| 昭苏县| 涡阳县| 息烽县| 孝昌县| 临清市| 荃湾区| 马关县| 应用必备| 宁波市| 锡林郭勒盟| 龙胜| 潜山县| 吉木乃县| 西充县| 灵璧县| 玉树县|