- Machine Learning in Java
- AshishSingh Bhatia Bostjan Kaluza
- 176字
- 2021-06-10 19:30:05
Train and test sets
To estimate the generalization error, we split our data into two parts: training data and testing data. A general rule of thumb is to split them by the training: testing ratio, that is, 70:30. We first train the predictor on the training data, then predict the values for the test data, and finally, compute the error, that is, the difference between the predicted and the true values. This gives us an estimate of the true generalization error.
The estimation is based on the two following assumptions: first, we assume that the test set is an unbiased sample from our dataset; and second, we assume that the actual new data will reassemble the distribution as our training and testing examples. The first assumption can be mitigated by cross-validation and stratification. Also, if it is scarce, one can't afford to leave out a considerable amount of data for a separate test set, as learning algorithms do not perform well if they don't receive enough data. In such cases, cross-validation is used instead.
- Project 2007項目管理實用詳解
- 大學計算機基礎:基礎理論篇
- 計算機控制技術
- 機艙監測與主機遙控
- Photoshop CS3圖層、通道、蒙版深度剖析寶典
- Moodle Course Design Best Practices
- 具比例時滯遞歸神經網絡的穩定性及其仿真與應用
- Enterprise PowerShell Scripting Bootcamp
- LAMP網站開發黃金組合Linux+Apache+MySQL+PHP
- MCGS嵌入版組態軟件應用教程
- 計算機組網技術
- Unity Multiplayer Games
- Machine Learning Algorithms(Second Edition)
- TensorFlow Deep Learning Projects
- 漢字錄入技能訓練