- Effective Amazon Machine Learning
- Alexis Perrier
- 287字
- 2021-07-03 00:17:50
What's an algorithm? What's a model?
Before we pe into data munging, let's take a moment to explain the difference between an algorithm and a model, two terms we've been using up until now without a formal definition.
Consider the simple linear regression example we saw in Chapter 1, Introduction to Machine Learning and Predictive Analytics — the linear regression equation with one predictor:

Here, x is the variable, ? the prediction, not the real value, and (a,b) the parameters of the linear regression model:
- The conceptual or theoretical model is the representation of the data that is the most adapted to the actual dataset. It is chosen at the beginning by the data scientist. In this case, the conceptual model is the linear regression model, where the prediction is a linear combination of a variable. Other conceptual models include decision trees, naive bayes, neural networks, and so on. All these models have parameters that need to be tuned to the actual data.
- The algorithm is the computational process that will calculate the optimal parameters of the conceptual model. In our simple linear regression case, the algorithm will calculate the optimal parameters a and b. Here optimal means that it gives the best predictions given the available dataset.
- Finally, the predictive model corresponds to the conceptual model associated with the optimal parameters found for the available dataset.
In reality, no one explicitly distinguishes between the conceptual model and the predictive model. Both are called the model.
In short, the algorithm is the method of learning, and the model is what results form the learning phase. The model is the conceptual model (trees, svm, linear) trained by the algorithm on your training dataset.
- 數據庫基礎與應用:Access 2010
- MongoDB管理與開發精要
- Visual Studio 2015 Cookbook(Second Edition)
- 新型數據庫系統:原理、架構與實踐
- 數據化網站運營深度剖析
- Mastering Machine Learning with R(Second Edition)
- INSTANT Cytoscape Complex Network Analysis How-to
- 城市計算
- 達夢數據庫性能優化
- Learning Proxmox VE
- 跟老男孩學Linux運維:MySQL入門與提高實踐
- 基于OPAC日志的高校圖書館用戶信息需求與檢索行為研究
- 大數據架構商業之路:從業務需求到技術方案
- 云數據中心網絡與SDN:技術架構與實現
- 大數據治理與安全:從理論到開源實踐