官术网_书友最值得收藏!

Analyzing Different Datasets

We all love to talk about the weather. So, let's work with some weather-related datasets. The datasets contain approximately five years' worth of high-temporal resolution (hourly measurements) data for various weather attributes, such as temperature, humidity, air pressure, and so on. We'll analyze and compare the humidity and weather datasets.

Let's begin by implementing the following steps:

  1. Load the humidity dataset by using the following command:
df_hum <- read.csv("data/historical-hourly-weather-data/humidity.csv")
  1. Load the weather description dataset by using the following command:
df_desc <- read.csv("data/historical-hourly-weather-data/weather_description.csv")
  1. Compare the two datasets by using the str command.

The outcome will be the humidity levels of different cities, as follows:

The weather descriptions of different cities are shown as follows:

The different geometric objects that we will be working with in this chapter are as follows:

One-dimensional objects are used to understand and visualize the characteristics of a single variable, as follows:

  • Histogram
  • Bar chart

Two-dimensional objects are used to visualize the relationship between two variables, as follows:

  • Bar chart
  • Boxplot
  • Line chart
  • Scatter plot

Although geometric objects are also used in base R, they don't follow the structure of the Grammar of Graphics and have different naming conventions, as compared to ggplot2. This is an important distinction, which we will look at in detail later.

主站蜘蛛池模板: 红原县| 南丹县| 古浪县| 长海县| 和林格尔县| 望江县| 宾川县| 孝昌县| 和硕县| 扬中市| 台东县| 安溪县| 南华县| 溧水县| 临沧市| 永年县| 岱山县| 湖口县| 江安县| 屯昌县| 来宾市| 静乐县| 当涂县| 上虞市| 高青县| 毕节市| 伊宁市| 泾阳县| 永福县| 墨竹工卡县| 陇南市| 青海省| 潮州市| 九江县| 库尔勒市| 青神县| 巴楚县| 黄石市| 苏州市| 怀集县| 通化市|