官术网_书友最值得收藏!

Recognizing named entities

NER is a type of text annotation task. In NER, words or tokens in a piece of text are labeled or annotated into categories, such as organizations, locations, people, and so on. In effect, NER converts unstructured text data into structured data that can later be used for further analysis. The following screenshot is a visualization from the Google Cloud API. The reader can try out the API with the link provided in the preceding subsection:

The output result in the preceding screenshot shows how the different entities, such as ORGANISATION (Google), PERSON (Sundar Pitchai), EVENT (CONSUMER ELECTRONICS SHOW), and so on, are automatically extracted from the unstructured raw text by NER. The output also gives the sentiment for each label or category, based on sentiment analysis. The reader can experiment with different text using the link provided earlier. When we click on the Categories tab, we can see the following:

The preceding screenshot shows how the system also classifies a particular piece of text into Computer & Electronics, News, and so on, using the recognized named entities in the text. Such a categorization, called topic modeling, is another important NLP task, used to identify the main theme or topic of a sentence or document.

主站蜘蛛池模板: 三原县| 阳西县| 儋州市| 芜湖市| 葫芦岛市| 永修县| 临沂市| 沙洋县| 寿宁县| 朝阳市| 乌兰县| 长武县| 高密市| 石楼县| 维西| 云安县| 鹤山市| 三门县| 故城县| 休宁县| 仁布县| 新巴尔虎左旗| 东乌| 新郑市| 石嘴山市| 当阳市| 怀来县| 沈丘县| 喜德县| 稷山县| 休宁县| 绍兴市| 远安县| 义马市| 昔阳县| 玉环县| 张家港市| 三门峡市| 丰都县| 牟定县| 商洛市|