官术网_书友最值得收藏!

Summary

In this chapter, we established the framework for the different data processing units that will be introduced in this book. There is a very good reason why the topics of model validation and overfitting are treated early on in this book: there is no point in building models and selecting algorithms if we do not have a methodology to evaluate their relative merits.

In this chapter, you were introduced to the following topics:

  • The concept of monadic transformation for implicit and explicit models
  • The versatility and cleanness of the cake pattern and mixin composition in Scala as an effective scaffolding tool for data processing
  • A robust methodology to validate machine learning models
  • The challenge in fitting models to both training and real-world data

The next chapter will address the problem of overfitting by identifying outliers and reducing noise in data.

主站蜘蛛池模板: 西丰县| 彩票| 察哈| 临潭县| 新竹县| 博野县| 柘荣县| 乌兰察布市| 灌云县| 罗江县| 曲周县| 竹山县| 灵台县| 平顺县| 佛山市| 岗巴县| 藁城市| 全南县| 吴桥县| 枣庄市| 九寨沟县| 罗城| 平湖市| 阿合奇县| 牙克石市| 双辽市| 莱芜市| 太康县| 株洲市| 山东省| 于田县| 北辰区| 玉树县| 个旧市| 利川市| 新泰市| 邳州市| 石林| 景宁| 城口县| 勃利县|