官术网_书友最值得收藏!

Summary

In this chapter, we looked at the various steps that are needed to build a natural language vocabulary. These play the most critical role in preprocessing any natural language data. Data preprocessing is probably one of the most important aspects of any machine learning application, and the same applies to NLP as well. When performed properly, these steps help with the machine learning aspects that generally occur after preprocessing the data, consequently providing better results most of the time compared with scenarios where no preprocessing is involved.

In the next chapter, we will use the techniques discussed in this chapter to preprocess data and subsequently build mathematical representations of text that can be understood by machine learning algorithms.

主站蜘蛛池模板: 三原县| 文昌市| 金坛市| 沂源县| 隆林| 建昌县| 屯昌县| 宁晋县| 清远市| 礼泉县| 扎鲁特旗| 西安市| 资中县| 高台县| 永昌县| 耒阳市| 桃江县| 沂水县| 湖南省| 商丘市| 崇信县| 彰化市| 长海县| 扎赉特旗| 安图县| 潞城市| 玛曲县| 郴州市| 龙江县| 安泽县| 建宁县| 正镶白旗| 莎车县| 桐柏县| 平阳县| 普洱| 务川| 资中县| 四子王旗| 临汾市| 湖北省|