官术网_书友最值得收藏!

What this book covers

Chapter 1, Tokenizing Text and WordNet Basics, covers the basics of tokenizing text and using WordNet.

Chapter 2, Replacing and Correcting Words, discusses various word replacement and correction techniques. The recipes cover the gamut of linguistic compression, spelling correction, and text normalization.

Chapter 3, Text Classification, describes a way to categorize documents or pieces of text and, by examining the word usage in a piece of text, classifiers decide what class label should be assigned to it.

主站蜘蛛池模板: 吉林市| 明溪县| 洪湖市| 基隆市| 慈利县| 武宣县| 丹凤县| 西峡县| 砀山县| 和静县| 昌都县| 射阳县| 年辖:市辖区| 治多县| 衡阳县| 义马市| 岳阳县| 濮阳市| 阳山县| 大足县| 瓮安县| 镇远县| 保康县| 高安市| 四子王旗| 方山县| 清水河县| 耿马| 区。| 津市市| 七台河市| 邓州市| 太保市| 桂林市| 平湖市| 金门县| 深水埗区| 九龙坡区| 宜阳县| 金湖县| 客服|