- Statistics for Machine Learning
- Pratap Dangeti
- 300字
- 2021-07-02 19:05:54
Steps in machine learning model development and deployment
The development and deployment of machine learning models involves a series of steps that are almost similar to the statistical modeling process, in order to develop, validate, and implement machine learning models. The steps are as follows:
- Collection of data: Data for machine learning is collected directly from structured source data, web scrapping, API, chat interaction, and so on, as machine learning can work on both structured and unstructured data (voice, image, and text).
- Data preparation and missing/outlier treatment: Data is to be formatted as per the chosen machine learning algorithm; also, missing value treatment needs to be performed by replacing missing and outlier values with the mean/median, and so on.
- Data analysis and feature engineering: Data needs to be analyzed in order to find any hidden patterns and relations between variables, and so on. Correct feature engineering with appropriate business knowledge will solve 70 percent of the problems. Also, in practice, 70 percent of the data scientist's time is spent on feature engineering tasks.
- Train algorithm on training and validation data: Post feature engineering, data will be divided into three chunks (train, validation, and test data) rather than two (train and test) in statistical modeling. Machine learning are applied on training data and the hyperparameters of the model are tuned based on validation data to avoid overfitting.
- Test the algorithm on test data: Once the model has shown a good enough performance on train and validation data, its performance will be checked against unseen test data. If the performance is still good enough, we can proceed to the next and final step.
- Deploy the algorithm: Trained machine learning algorithms will be deployed on live streaming data to classify the outcomes. One example could be recommender systems implemented by e-commerce websites.
推薦閱讀
- 少兒人工智能趣味入門:Scratch 3.0動畫與游戲編程
- 基于粒計算模型的圖像處理
- 自制編譯器
- PaaS程序設(shè)計
- C語言程序設(shè)計
- Python機(jī)器學(xué)習(xí)經(jīng)典實(shí)例
- 數(shù)據(jù)結(jié)構(gòu)案例教程(C/C++版)
- Linux:Embedded Development
- 領(lǐng)域驅(qū)動設(shè)計:軟件核心復(fù)雜性應(yīng)對之道(修訂版)
- Arduino Wearable Projects
- HTML5+CSS3+JavaScript 從入門到項(xiàng)目實(shí)踐(超值版)
- 深度實(shí)踐KVM:核心技術(shù)、管理運(yùn)維、性能優(yōu)化與項(xiàng)目實(shí)施
- Tableau Dashboard Cookbook
- PHP+MySQL Web應(yīng)用開發(fā)教程
- Mastering Magento Theme Design