官术网_书友最值得收藏!

Summary

The chapter began with an introduction to some of the most important datasets that will be used in the rest of the book. The datasets covered a range of analytical problems including classification, regression, time series, survival, clustering, and a dataset in which identifying an outlier is important. Important families of classification models were then introduced in the statistical/machine learning models section. Following the introduction of a variety of models, we immediately saw the shortcoming, in that we don't have a model for all seasons. Model performance varies from dataset to dataset. Depending on the initialization, the performance of certain models (such as neural networks) is affected. Consequently, there is a need to find a way to ensure that the models can be improved upon in most scenarios.

This paves the way for the ensemble method, which forms the title of this book. We will elaborate on this method in the rest of the book. This chapter closed with quick statistical tests that will help in carrying out model comparisons. Resampling forms the core of ensemble methods, and we will look at the important jackknife and bootstrap methods in the next chapter.

主站蜘蛛池模板: 论坛| 鄢陵县| 新河县| 滁州市| 昆山市| 仙居县| 望谟县| 集贤县| 钟山县| 凤山市| 裕民县| 顺平县| 上饶县| 惠来县| 曲靖市| 乐山市| 贵阳市| 松潘县| 肇庆市| 大厂| 日土县| 大邑县| 武义县| 新源县| 许昌市| 化隆| 平原县| 城固县| 天峨县| 台安县| 兴仁县| 胶州市| 苏尼特左旗| 建德市| 北安市| 江达县| 来宾市| 阿拉尔市| 哈巴河县| 舞钢市| 金阳县|