官术网_书友最值得收藏!

Introduction

In the previous chapter, you learned about various extraction methods, such as tokenization, stemming, lemmatization, and stop-word removal, which are used to extract features from unstructured text. We also discussed Bag-of-Words and Term Frequency-Inverse Document Frequency (TF-IDF).

In this chapter, you will learn how to use these extracted features to develop machine learning models. These models are capable of solving real-world problems such as detecting whether sentiments carried by texts are positive or negative, predicting whether emails are spam or not, and so on. We will also cover concepts such as supervised and unsupervised learning, classifications and regressions, the sampling and splitting of data, along with evaluating the performance of a model in depth. This chapter also discusses how to load and save these models for future use.

主站蜘蛛池模板: 夏邑县| 虎林市| 双辽市| 且末县| 新竹市| 寿光市| 耒阳市| 汉川市| 睢宁县| 白银市| 乐业县| 莱阳市| 玉树县| 兴义市| 南安市| 庆城县| 卫辉市| 吴桥县| 桃源县| 酉阳| 乌恰县| 芮城县| 瑞金市| 新龙县| 通道| 宁德市| 肥东县| 紫金县| 灵山县| 自贡市| 资中县| 兴义市| 韶山市| 北安市| 庆安县| 永川市| 三台县| 咸丰县| 德兴市| 大安市| 彭阳县|