- Hands-On Natural Language Processing with Python
- Rajesh Arumugam Rajalingappaa Shanmugamani
- 163字
- 2021-08-13 16:01:42
Text Classification and POS Tagging Using NLTK
The Natural Language Toolkit (NLTK) is a Python library for handling natural language processing (NLP) tasks, ranging from segmenting words or sentences to performing advanced tasks, such as parsing grammar and classifying text. NLTK provides several modules and interfaces to work on natural language, useful for tasks such as document topic identification, parts of speech (POS) tagging, sentiment analysis, and so on. For experimentation with various NLP tasks, NLTK also includes modules for a wide range of text corpora, from basic text collections to tagged and structured texts, such as WordNet. While the NLTK library provides a vast set of APIs, we will only cover the most important aspects that are commonly used in practical NLP applications.
We will cover the following topics in this chapter:
- Installing NLTK and its modules
- Text preprocessing and exploratory analysis
- Exploratory analysis of text
- POS tagging
- Training a sentiment classifier for movie reviews
- Training a bag-of-words classifier
- Boost程序庫(kù)完全開(kāi)發(fā)指南:深入C++”準(zhǔn)”標(biāo)準(zhǔn)庫(kù)(第5版)
- 潮流:UI設(shè)計(jì)必修課
- 跟老齊學(xué)Python:輕松入門(mén)
- Python測(cè)試開(kāi)發(fā)入門(mén)與實(shí)踐
- Python GUI Programming Cookbook
- Unreal Engine 4 Shaders and Effects Cookbook
- Visual Basic程序設(shè)計(jì)習(xí)題與上機(jī)實(shí)踐
- Kotlin進(jìn)階實(shí)戰(zhàn)
- Android智能手機(jī)APP界面設(shè)計(jì)實(shí)戰(zhàn)教程
- 高性能PHP 7
- 算法超簡(jiǎn)單:趣味游戲帶你輕松入門(mén)與實(shí)踐
- 前端架構(gòu)設(shè)計(jì)
- Serverless工程實(shí)踐:從入門(mén)到進(jìn)階
- Python自動(dòng)化開(kāi)發(fā)實(shí)戰(zhàn)
- Java程序設(shè)計(jì)