官术网_书友最值得收藏!

Introduction

In the previous chapter, you learned about various extraction methods, such as tokenization, stemming, lemmatization, and stop-word removal, which are used to extract features from unstructured text. We also discussed Bag-of-Words and Term Frequency-Inverse Document Frequency (TF-IDF).

In this chapter, you will learn how to use these extracted features to develop machine learning models. These models are capable of solving real-world problems such as detecting whether sentiments carried by texts are positive or negative, predicting whether emails are spam or not, and so on. We will also cover concepts such as supervised and unsupervised learning, classifications and regressions, the sampling and splitting of data, along with evaluating the performance of a model in depth. This chapter also discusses how to load and save these models for future use.

主站蜘蛛池模板: 同德县| 庆安县| 姜堰市| 修水县| 甘德县| 萝北县| 柳江县| 华阴市| 金平| 万全县| 瓦房店市| 瓦房店市| 任丘市| 方山县| 长岭县| 东乌珠穆沁旗| 湟中县| 隆林| 锡林郭勒盟| 昌图县| 巴里| 龙泉市| 蕲春县| 东明县| 石台县| 北票市| 玉门市| 绿春县| 松潘县| 余姚市| 上虞市| 皋兰县| 太湖县| 临城县| 三亚市| 伊春市| 彝良县| 贵溪市| 伽师县| 高淳县| 东兰县|