官术网_书友最值得收藏!

Paragraph

A paragraph is the largest unit of text handled by an NLP task. Paragraph level boundaries by itself may not be much use unless broken down into sentences. Though sometimes the paragraph may be considered as context boundaries. Tokenizers that can split a document into paragraphs are available in some of the Python libraries. We will look at such tokenizers in later chapters.

主站蜘蛛池模板: 乌鲁木齐县| 大港区| 金寨县| 武陟县| 方山县| 营山县| 河南省| 包头市| 辽阳市| 宜都市| 晋宁县| 丽水市| 本溪| 淮滨县| 大港区| 浙江省| 拉萨市| 高雄市| 井冈山市| 鲁甸县| 托里县| 五寨县| 新巴尔虎右旗| 吴旗县| 遵化市| 苗栗市| 噶尔县| 沅陵县| 洛阳市| 三江| 澄迈县| 周口市| 长子县| 屏东市| 南漳县| 抚远县| 锡林郭勒盟| 顺昌县| 蛟河市| 景洪市| 五原县|