- Python 3 Text Processing with NLTK 3 Cookbook
- Jacob Perkins
- 211字
- 2021-09-03 09:45:34
Conventions
In this book, you will find a number of styles of text that distinguish between different kinds of information. Here are some examples of these styles, and an explanation of their meaning.
Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: "The sent_tokenize
function uses an instance of PunktSentenceTokenizer
from the nltk.tokenize.punkt
module."
A block of code is set as follows:
>>> from nltk.tokenize import sent_tokenize >>> sent_tokenize(para) ['Hello World.', "It's good to see you.", 'Thanks for buying this book.']
When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:
>>> doc.make_links_absolute('http://hello')
>>> abslinks = list(doc.iterlinks())
>>> (el, attr, link, pos) = abslinks[0]
>>> link
'http://hello/world'
Any command-line input or output is written as follows:
$ python train_chunker.py treebank_chunk
New terms and important words are shown in bold. Words that you see on the screen, in menus or dialog boxes for example, appear in the text like this: "Luckily, this will produce an exception with the message 'DictVectorizer' object has no attribute 'vocabulary_'".
Note
Warnings or important notes appear in a box like this.
Tip
Tips and tricks appear like this.
- Java Web開發(fā)學(xué)習(xí)手冊(cè)
- 解構(gòu)產(chǎn)品經(jīng)理:互聯(lián)網(wǎng)產(chǎn)品策劃入門寶典
- LabVIEW2018中文版 虛擬儀器程序設(shè)計(jì)自學(xué)手冊(cè)
- 樂學(xué)Web編程:網(wǎng)站制作不神秘
- Building RESTful Python Web Services
- Python Data Structures and Algorithms
- Mastering Linux Security and Hardening
- 編寫高質(zhì)量代碼:改善Objective-C程序的61個(gè)建議
- Orchestrating Docker
- Practical Microservices
- iOS開發(fā)項(xiàng)目化入門教程
- Python Web自動(dòng)化測(cè)試設(shè)計(jì)與實(shí)現(xiàn)
- C語言程序設(shè)計(jì)
- Elasticsearch Blueprints
- 零基礎(chǔ)學(xué)SQL(升級(jí)版)