- Machine Learning with Swift
- Alexander Sosnovshchenko
- 222字
- 2021-06-24 18:54:55
Converting categorical variables
As you already have noticed, a data frame can contain columns with the data of different types. To see which type has each column, we can check the dtypes attribute of the data frame. You can think about Python attributes as being similar to Swift properties:
In []: df.dtypes Out[]: length float64 color object fluffy bool label object dtype: object
While length and fluffy columns contain the expected datatypes, the types of color and label are less transparent. What are those objects? This means those columns can contain any type of the object. At the moment, we have strings in them, but what we really want them to be are categorical variables. In case you don't remember from the previous chapter, categorical variables are like Swift enums. Fortunately for us, data frame has handy methods for converting columns from one type to another:
In []: df.color = df.color.astype('category') df.label = df.label.astype('category')
That's it. Let's check:
In []: df.dtypes Out []: length float64 color category fluffy bool label category dtype: object
color and label are categories now. To see all colors in those categories, execute:
In []: colors = df.color.cat.categories.get_values().astype('string') colors Out[]: array(['light black', 'pink gold', 'purple polka-dot', 'space gray'], dtype='|S16')
As expected, we have four colors. '|S16' stands for strings of 16 characters in length.
- 觸摸屏實用技術與工程應用
- Instant uTorrent
- AMD FPGA設計優化寶典:面向Vivado/SystemVerilog
- The Deep Learning with Keras Workshop
- 分布式微服務架構:原理與實戰
- 微服務分布式架構基礎與實戰:基于Spring Boot + Spring Cloud
- 微軟互聯網信息服務(IIS)最佳實踐 (微軟技術開發者叢書)
- Arduino BLINK Blueprints
- Machine Learning with Go Quick Start Guide
- IP網絡視頻傳輸:技術、標準和應用
- 單片機項目設計教程
- Drupal Rules How-to
- The Applied Artificial Intelligence Workshop
- Hands-On One-shot Learning with Python
- DevOps實戰:VMware管理員運維方法、工具及最佳實踐