官术网_书友最值得收藏!

Summary

In this chapter, we looked at the various steps that are needed to build a natural language vocabulary. These play the most critical role in preprocessing any natural language data. Data preprocessing is probably one of the most important aspects of any machine learning application, and the same applies to NLP as well. When performed properly, these steps help with the machine learning aspects that generally occur after preprocessing the data, consequently providing better results most of the time compared with scenarios where no preprocessing is involved.

In the next chapter, we will use the techniques discussed in this chapter to preprocess data and subsequently build mathematical representations of text that can be understood by machine learning algorithms.

主站蜘蛛池模板: 古交市| 富平县| 青河县| 龙南县| 田东县| 营山县| 罗江县| 汉寿县| 大余县| 彭水| 澜沧| 高邑县| 商南县| 宜昌市| 辉县市| 格尔木市| 皋兰县| 三台县| 凌源市| 华亭县| 丹寨县| 内乡县| 永清县| 永昌县| 黔南| 潢川县| 偃师市| 太湖县| 灵寿县| 五莲县| 高雄市| 历史| 万安县| 普安县| 伊通| 台山市| 乐都县| 炉霍县| 大港区| 罗山县| 固原市|