- Natural Language Processing Fundamentals
- Sohom Ghosh Dwight Gunning
- 133字
- 2021-06-11 13:42:29
Introduction
In the previous chapter, we learned about the concepts of Natural Language Processing (NLP) and text analytics. We also looked at various pre-processing steps in brief. In this chapter, we will learn how to deal with text data whose formats are mostly unstructured. Unstructured data cannot be represented in a tabular format. Therefore, it is essential to convert it into numeric features because most machine learning algorithms are capable of dealing only with numbers. More emphasis will be put on steps such as tokenization, stemming, lemmatization, and stop-word removal. You will also learn about two popular methods for feature extraction: bag of words and Term Frequency-Inverse Document Frequency, as well as various methods for creating new features from existing features. Finally, you will become familiar with how text data can be visualized.
- 大數據項目管理:從規劃到實現
- OpenStack for Architects
- Dreamweaver CS3網頁設計與網站建設詳解
- Learning Apache Cassandra(Second Edition)
- JMAG電機電磁仿真分析與實例解析
- Hadoop Real-World Solutions Cookbook(Second Edition)
- 精通特征工程
- 基于多目標決策的數據挖掘方法評估與應用
- 新手學電腦快速入門
- 可編程序控制器應用實訓(三菱機型)
- Docker on Amazon Web Services
- 從零開始學PHP
- LMMS:A Complete Guide to Dance Music Production Beginner's Guide
- Introduction to R for Business Intelligence
- Mastering Predictive Analytics with scikit:learn and TensorFlow