- Machine Learning with Swift
- Alexander Sosnovshchenko
- 144字
- 2021-06-24 18:54:56
Splitting the data
Finally, we want to split our data into training and test sets. We will train our classifier only on the training set, so it will never see the test set until we want to evaluate its performance. This is a very important step, because as we will see in the future, the quality of predictions on the test set can differ dramatically from the quality measured on the training set. Data splitting is an operation specific to machine learning tasks, so we will import scikit-learn (a machine learning package) and use some functions from it:
In []: from sklearn.model_selection import train_test_split X_train, X_test, y_train, y_test = train_test_split(features, labels, test_size=0.3, random_state=42) X_train.shape, y_train.shape, X_test.shape, y_test.shape Out[]: ((700, 6), (700,), (300, 6), (300,))
Now we have 700 training samples with 6 features each, and 300 test samples with the same number of features.
推薦閱讀
- 新媒體跨界交互設計
- Applied Unsupervised Learning with R
- 電腦組裝與維修從入門到精通(第2版)
- The Applied AI and Natural Language Processing Workshop
- 數字邏輯(第3版)
- VCD、DVD原理與維修
- Mastering Adobe Photoshop Elements
- Arduino BLINK Blueprints
- OpenGL Game Development By Example
- 單片機開發與典型工程項目實例詳解
- 電腦橫機使用與維修
- 可編程邏輯器件項目開發設計
- USB應用開發寶典
- 筆記本電腦的結構、原理與維修
- MicroPython Cookbook