官术网_书友最值得收藏!

Text processing

It is possible to do simple text processing using only the standard Java library with classes such as StringTokenizer, the java.text package, or the regular expressions.

In addition to that, there is a big variety of text processing frameworks available for Java as follows:

Most NLP libraries have very similar functionality and coverage of algorithms, which is why selecting which one to use is usually a matter of habit or taste. They all typically have tokenization, parsing, part-of-speech tagging, named entity recognition, and other algorithms for text processing. Some of them (such as StanfordNLP) support multiple languages, and some support only English.

We will cover some of these libraries in Chapter 6Working with Text - Natural Language Processing and Information Retrival.

主站蜘蛛池模板: 绥芬河市| 龙南县| 鸡东县| 泌阳县| 胶州市| 加查县| 定安县| 肇庆市| 福清市| 唐山市| 独山县| 桐柏县| 新密市| 泗阳县| 祁连县| 雷州市| 鸡西市| 湖北省| 吕梁市| 杭锦旗| 贵定县| 奉化市| 墨玉县| 交口县| 土默特右旗| 丹棱县| 宝坻区| 信阳市| 习水县| 清苑县| 牙克石市| 常德市| 巩义市| 博白县| 滦平县| 屏东市| 海兴县| 札达县| 济阳县| 新邵县| 桐乡市|