官术网_书友最值得收藏!

Model selection

This step comes after selecting a proper subset of your input variables by using any dimensionality reduction technique. Choosing the proper subset of the input variable will make the rest of the learning process very simple.

In this step, you are trying to figure out the right model to learn.

If you have any prior experience with data science and applying learning methods to different domains and different kinds of data, then you will find this step easy as it requires prior knowledge of how your data looks and what assumptions could fit the nature of your data, and based on this you choose the proper learning method. If you don't have any prior knowledge, that's also fine because you can do this step by guessing and trying different learning methods with different parameter settings and choose the one that gives you better performance over the test set.

Also, initial data analysis and visualization will help you to make a good guess about the form of the distribution and nature of your data.

主站蜘蛛池模板: 丰原市| 双牌县| 西盟| 朝阳县| 礼泉县| 怀宁县| 锦屏县| 定日县| 金昌市| 巩留县| 伽师县| 普洱| 德庆县| 中方县| 梁平县| 新蔡县| 临湘市| 遂宁市| 中方县| 上饶县| 灵璧县| 西乌珠穆沁旗| 湖州市| 永济市| 称多县| 乐昌市| 东至县| 金溪县| 河北省| 罗江县| 安远县| 和顺县| 天峻县| 莆田市| 林周县| 北安市| 龙门县| 青龙| 六盘水市| 齐齐哈尔市| 正安县|