官术网_书友最值得收藏!

Summary

In this chapter, we established the framework for the different data processing units that will be introduced in this book. There is a very good reason why the topics of model validation and overfitting are treated early on in this book: there is no point in building models and selecting algorithms if we do not have a methodology to evaluate their relative merits.

In this chapter, you were introduced to the following topics:

  • The concept of monadic transformation for implicit and explicit models
  • The versatility and cleanness of the cake pattern and mixin composition in Scala as an effective scaffolding tool for data processing
  • A robust methodology to validate machine learning models
  • The challenge in fitting models to both training and real-world data

The next chapter will address the problem of overfitting by identifying outliers and reducing noise in data.

主站蜘蛛池模板: 松潘县| 台前县| 铜山县| 个旧市| 兰西县| 荔浦县| 闻喜县| 台州市| 海口市| 连江县| 枣阳市| 嘉祥县| 卢氏县| 固镇县| 虹口区| 镇原县| 常熟市| 宁阳县| 陇川县| 分宜县| 南江县| 定州市| 淳化县| 万山特区| 诸暨市| 卢湾区| 佛坪县| 丹凤县| 云和县| 镇安县| 隆林| 民乐县| 鄂伦春自治旗| 隆尧县| 晋州市| 裕民县| 乌兰县| 汽车| 永年县| 张家川| 萝北县|