官术网_书友最值得收藏!

Text processing

It is possible to do simple text processing using only the standard Java library with classes such as StringTokenizer, the java.text package, or the regular expressions.

In addition to that, there is a big variety of text processing frameworks available for Java as follows:

Most NLP libraries have very similar functionality and coverage of algorithms, which is why selecting which one to use is usually a matter of habit or taste. They all typically have tokenization, parsing, part-of-speech tagging, named entity recognition, and other algorithms for text processing. Some of them (such as StanfordNLP) support multiple languages, and some support only English.

We will cover some of these libraries in Chapter 6Working with Text - Natural Language Processing and Information Retrival.

主站蜘蛛池模板: 双峰县| 承德市| 宁海县| 当涂县| 曲水县| 历史| 芮城县| 清水河县| 松滋市| 津市市| 开封县| 明光市| 东方市| 内丘县| 积石山| 长宁县| 黎川县| 宁阳县| 栾川县| 扎鲁特旗| 涿鹿县| 石阡县| 芜湖市| 奎屯市| 文登市| 台北市| 朝阳县| 措勤县| 铜鼓县| 广南县| 汪清县| 齐河县| 柞水县| 莲花县| 罗甸县| 石柱| 东辽县| 凤山县| 精河县| 旅游| 萝北县|