- Mastering Machine Learning with Spark 2.x
- Alex Tellez Max Pumperla Michal Malohlava
- 237字
- 2021-07-02 18:46:06
Data science - an iterative process
Often, the process flow of many big data projects is iterative, which means a lot of back-and-forth testing new ideas, new features to include, tweaking various hyper-parameters, and so on, with a fail fast attitude. The end result of these projects is usually a model that can answer a question being posed. Notice that we didn't say accurately answer a question being posed! One pitfall of many data scientists these days is their inability to generalize a model for new data, meaning that they have overfit their data so that the model provides poor results when given new data. Accuracy is extremely task-dependent and is usually dictated by the business needs with some sensitivity analysis being done to weigh the cost-benefits of the model outcomes. However, there are a few standard accuracy measures that we will go over throughout this book so that you can compare various models to see how changes to the model impact the result.
- MySQL數據庫管理實戰
- Android Development with Kotlin
- 跟老齊學Python:輕松入門
- INSTANT Sencha Touch
- Java深入解析:透析Java本質的36個話題
- Raspberry Pi 2 Server Essentials
- Xamarin.Forms Projects
- Python漫游數學王國:高等數學、線性代數、數理統計及運籌學
- Linux命令行與shell腳本編程大全(第4版)
- Learning Unity 2D Game Development by Example
- Learning Continuous Integration with TeamCity
- App Inventor創意趣味編程進階
- Natural Language Processing with Python Quick Start Guide
- ASP.NET求職寶典
- Mastering Apache Camel