- Scala for Machine Learning(Second Edition)
- Patrick R. Nicolas
- 147字
- 2021-07-08 10:43:04
Chapter 2. Data Pipelines
In the first chapter, you were acquainted with some rudimentary concepts regarding data processing, clustering, and classification.
This chapter is dedicated to the creation and maintenance of a flexible end-to-end workflow to train and classify data. The first section of the chapter introduces a data-centric (functional) approach to create number crunching applications, followed by a description of a configurable workflow computation model. The chapter concludes with an overview of different model validation techniques.
You will learn how to do the following:
- Apply the concept of monadic design to create dynamic workflows
- Leverage some of Scala's advanced patterns, such as the cake pattern, to build portable computational workflows
- Take into account the bias-variance trade-off in selecting a model
- Overcome overfitting in modeling
- Break down data into training, test and validation sets
- Implement model validation in Scala using precision, recall, and F score
推薦閱讀
- MongoDB for Java Developers
- Python測試開發(fā)入門與實踐
- 精通搜索分析
- 鋒利的SQL(第2版)
- Android Native Development Kit Cookbook
- Python編程:從入門到實踐
- Mastering ROS for Robotics Programming
- Python High Performance Programming
- 學(xué)習(xí)OpenCV 4:基于Python的算法實戰(zhàn)
- Android傳感器開發(fā)與智能設(shè)備案例實戰(zhàn)
- Android應(yīng)用開發(fā)實戰(zhàn)
- Natural Language Processing with Python Quick Start Guide
- 深入淺出Python數(shù)據(jù)分析
- WebStorm Essentials
- JBoss AS 7 Development