官术网_书友最值得收藏!

Modeling

Once the data preparation is complete, the next phase is modeling. Here, you will be selecting an appropriate algorithm and using the data to train your model. There are a number of best practices to adhere to during this stage, and we will discuss them in detail, but the basic steps involve splitting your data into training, testing, and validation sets. This splitting up of the data may seem illogical—especially when more data typically yields better models—but as we'll see, doing this allows us to get better feedback on how the model will perform in the real world, and prevents us from the cardinal sin of modeling: overfitting. We will talk more about this in later chapters.

主站蜘蛛池模板: 秦安县| 龙门县| 滨海县| 牙克石市| 务川| 竹溪县| 凌云县| 平阴县| 盐城市| 宁陕县| 台北县| 芜湖市| 吴旗县| 大宁县| 西青区| 龙游县| 沽源县| 资阳市| 乌苏市| 文水县| 南雄市| 和顺县| 行唐县| 娱乐| 台安县| 娄烦县| 三明市| 金坛市| 芜湖市| 乌兰浩特市| 合阳县| 黑水县| 西乌| 富源县| 内江市| 灵山县| 祥云县| 庆元县| 辛集市| 会泽县| 桐庐县|