官术网_书友最值得收藏!

What this book covers

Chapter 1, Tokenizing Text and WordNet Basics, covers the basics of tokenizing text and using WordNet.

Chapter 2, Replacing and Correcting Words, discusses various word replacement and correction techniques. The recipes cover the gamut of linguistic compression, spelling correction, and text normalization.

Chapter 3, Text Classification, describes a way to categorize documents or pieces of text and, by examining the word usage in a piece of text, classifiers decide what class label should be assigned to it.

主站蜘蛛池模板: 克拉玛依市| 车致| 邢台市| 河北省| 普格县| 射阳县| 灵丘县| 北流市| 唐山市| 广州市| 石楼县| 新乡县| 昭觉县| 兴和县| 南宁市| 丁青县| 通州市| 麦盖提县| 桂阳县| 泾源县| 乌兰察布市| 天等县| 简阳市| 子长县| 察雅县| 县级市| 大方县| 石泉县| 麟游县| 贵州省| 宁乡县| 忻城县| 盖州市| 视频| 麦盖提县| 柯坪县| 梧州市| 霍邱县| 彭水| 南充市| 呼玛县|