- Hands-On Natural Language Processing with Python
- Rajesh Arumugam Rajalingappaa Shanmugamani
- 143字
- 2021-08-13 16:01:47
One-hot encoding
One-hot encoding is a vectorization technique for labeled data, especially categorical data. In the case of binary labels, target variables will be presented as [0, 1], [1, 0]. The same representation for three classes will appear as [0, 0, 1], [0, 1, 0], [1, 0, 0]. This type of representation can support any number of categories. The main advantage of one-hot encoding is that it treats all categorical data equally, in contrast to arbitrary categorical labels. For instance, categories to represent colors such as red, green, and blue, may use integers such as 0, 1, and 2. Although there is no intrinsic order for colors, some ML models may treat such input as if it has an order. This is avoided in one-hot encoding, as it does not assume any order in the categorical values since they are binary encoded.
- Advanced Machine Learning with Python
- 從零構(gòu)建知識圖譜:技術(shù)、方法與案例
- Kubernetes實戰(zhàn)
- 深入淺出Electron:原理、工程與實踐
- PostgreSQL for Data Architects
- Architecting the Industrial Internet
- RTC程序設(shè)計:實時音視頻權(quán)威指南
- 程序員考試案例梳理、真題透解與強化訓(xùn)練
- JavaScript 程序設(shè)計案例教程
- 全棧自動化測試實戰(zhàn):基于TestNG、HttpClient、Selenium和Appium
- 深入解析Java編譯器:源碼剖析與實例詳解
- DB2SQL性能調(diào)優(yōu)秘笈
- 深入理解Kafka:核心設(shè)計與實踐原理
- Python數(shù)據(jù)科學(xué)實踐指南
- 讀故事學(xué)編程:Python王國歷險記