- Hands-On Natural Language Processing with Python
- Rajesh Arumugam Rajalingappaa Shanmugamani
- 167字
- 2021-08-13 16:01:45
Summary
In this chapter, we covered common NLP tasks, such as preprocessing and exploratory analysis of text using the NLTK library. The unstructured characteristics of real-world data need extensive preprocessing, such as tokenization, stemming, and stop word removal, to make it suitable for ML. As you saw in the examples, NLTK provides a very extensive API for carrying out these preprocessing steps. It provides built-in packages and modules, and supports flexibility to build custom modules, such as user-defined stemmers and tokenizers.
We also discussed using NLTK for POS tagging, which is another common NLP task, used for issues such as word sense disambiguation and answering questions. Applications such as sentiment classification are widely used for their research and business value. We covered some basic examples of text classification, in the context of sentiment analysis, for tweets and movie reviews, using the NLTK corpora and sklearn. While these can be used in simple NLP applications, more complex text classification, using deep learning, will be explained in subsequent chapters.
- LabVIEW2018中文版 虛擬儀器程序設計自學手冊
- Oracle Exadata性能優化
- 前端跨界開發指南:JavaScript工具庫原理解析與實戰
- PHP 7底層設計與源碼實現
- Java Web基礎與實例教程(第2版·微課版)
- Vue.js 2 and Bootstrap 4 Web Development
- HTML5入門經典
- Getting Started with NativeScript
- Visual Studio 2015高級編程(第6版)
- Android應用開發深入學習實錄
- Laravel Application Development Blueprints
- Android應用開發實戰
- Python從入門到精通(第3版)
- Android Development Tools for Eclipse
- C#面向對象程序設計(第2版)