官术网_书友最值得收藏!

Paragraph

A paragraph is the largest unit of text handled by an NLP task. Paragraph level boundaries by itself may not be much use unless broken down into sentences. Though sometimes the paragraph may be considered as context boundaries. Tokenizers that can split a document into paragraphs are available in some of the Python libraries. We will look at such tokenizers in later chapters.

主站蜘蛛池模板: 抚顺市| 家居| 太保市| 姚安县| 卢龙县| 张家川| 大宁县| 新乡市| 阳朔县| 龙川县| 同江市| 依安县| 太湖县| 平顶山市| 广东省| 钟祥市| 嘉黎县| 光泽县| 彩票| 涟水县| 五峰| 英山县| 香港 | 灵宝市| 巴林右旗| 西和县| 水城县| 南部县| 积石山| 沁水县| 天等县| 台南县| 贡嘎县| 博湖县| 页游| 沙河市| 留坝县| 宝坻区| 平乡县| 皮山县| 江川县|