- Hands-On Python Natural Language Processing
- Aman Kedia Mayank Rasu
- 133字
- 2021-06-18 18:28:56
Building Your NLP Vocabulary
In the earlier chapters, you were introduced to why Natural Language Processing (NLP) is important especially in today's context, which was followed by a discussion on a few prerequisites and Python libraries that are highly beneficial for NLP tasks. In this chapter, we will take this discussion further and discuss some of the most concrete tasks involved in building a vocabulary for NLP tasks and preprocessing textual data in detail. We will start by learning what a vocabulary is and take the notion forward to actually build a vocabulary. We will do this by applying various methods on text data that are present in most of the NLP pipelines across any organization.
In this chapter, we'll cover the following topics:
- Lexicons
- Phonemes, graphemes, and morphemes
- Tokenization
- Understanding word normalization
推薦閱讀
- Python Artificial Intelligence Projects for Beginners
- 大數據時代的數據挖掘
- ROS機器人編程與SLAM算法解析指南
- Hands-On Machine Learning with TensorFlow.js
- Python Data Science Essentials
- 自主研拋機器人技術
- 80x86/Pentium微型計算機原理及應用
- Ruby on Rails敏捷開發最佳實踐
- MATLAB/Simulink權威指南:開發環境、程序設計、系統仿真與案例實戰
- Mastering ServiceNow Scripting
- 過程控制系統
- 云計算和大數據的應用
- Data Analysis with R(Second Edition)
- Redash v5 Quick Start Guide
- Linux常用命令簡明手冊