- Hands-On Natural Language Processing with Python
- Rajesh Arumugam Rajalingappaa Shanmugamani
- 167字
- 2021-08-13 16:01:45
Summary
In this chapter, we covered common NLP tasks, such as preprocessing and exploratory analysis of text using the NLTK library. The unstructured characteristics of real-world data need extensive preprocessing, such as tokenization, stemming, and stop word removal, to make it suitable for ML. As you saw in the examples, NLTK provides a very extensive API for carrying out these preprocessing steps. It provides built-in packages and modules, and supports flexibility to build custom modules, such as user-defined stemmers and tokenizers.
We also discussed using NLTK for POS tagging, which is another common NLP task, used for issues such as word sense disambiguation and answering questions. Applications such as sentiment classification are widely used for their research and business value. We covered some basic examples of text classification, in the context of sentiment analysis, for tweets and movie reviews, using the NLTK corpora and sklearn. While these can be used in simple NLP applications, more complex text classification, using deep learning, will be explained in subsequent chapters.
- 小程序?qū)崙?zhàn)視頻課:微信小程序開發(fā)全案精講
- 程序員面試算法寶典
- Django開發(fā)從入門到實(shí)踐
- 程序員考試案例梳理、真題透解與強(qiáng)化訓(xùn)練
- Java程序設(shè)計(jì)與計(jì)算思維
- 21天學(xué)通C++(第6版)
- PLC編程與調(diào)試技術(shù)(松下系列)
- Mastering Apache Maven 3
- C++新經(jīng)典
- Multithreading in C# 5.0 Cookbook
- Mastering React
- Statistical Application Development with R and Python(Second Edition)
- Mastering AWS Security
- Mockito Essentials
- Python 快速入門(第3版)