官术网_书友最值得收藏!

Model selection

This step comes after selecting a proper subset of your input variables by using any dimensionality reduction technique. Choosing the proper subset of the input variable will make the rest of the learning process very simple.

In this step, you are trying to figure out the right model to learn.

If you have any prior experience with data science and applying learning methods to different domains and different kinds of data, then you will find this step easy as it requires prior knowledge of how your data looks and what assumptions could fit the nature of your data, and based on this you choose the proper learning method. If you don't have any prior knowledge, that's also fine because you can do this step by guessing and trying different learning methods with different parameter settings and choose the one that gives you better performance over the test set.

Also, initial data analysis and visualization will help you to make a good guess about the form of the distribution and nature of your data.

主站蜘蛛池模板: 内黄县| 西乡县| 黑山县| 东方市| 澄江县| 山阳县| 铜陵市| 玛纳斯县| 儋州市| 城口县| 华坪县| 瑞安市| 阿坝县| 卫辉市| 湖州市| 平远县| 图木舒克市| 应城市| 鹤山市| 马龙县| 仲巴县| 西峡县| 浮梁县| 屏边| 巴马| 巴塘县| 板桥市| 洪江市| 汪清县| 阳西县| 博客| 峡江县| 黑河市| 神农架林区| 聊城市| 临澧县| 泉州市| 齐齐哈尔市| 当涂县| 大田县| 黄浦区|