官术网_书友最值得收藏!

Preparing to Train a Predictive Model

Here, we will cover the preparation required to train a predictive model. Although not as technically glamorous as training the models themselves, this step should not be taken lightly. It's very important to ensure you have a good plan before proceeding with the details of building and training a reliable model. Furthermore, once you've decided on the right plan, there are technical steps in preparing the data for modeling that should not be overlooked. 

We must be careful not to go so deep into the weeds of technical tasks that we lose sight of the goal. Technical tasks include things that require programming skills, for example, constructing visualizations, querying databases, and validating predictive models. It's easy to spend hours trying to implement a specific feature or get the plots looking just right. Doing this sort of thing is certainly beneficial to our programming skills, but we should not forget to ask ourselves if it's really worth our time with respect to the current project.

Also, keep in mind that Jupyter Notebooks are particularly well-suited for this step, as we can use them to document our plan, for example, by writing rough notes about the data or a list of models we are interested in training. Before starting to train models, it's good practice to even take this a step further and write out a well-structured plan to follow. Not only will this help you stay on track as you build and test the models, but it will allow others to understand what you're doing when they see your work.

After discussing the preparation, we will also cover another step in preparing to train the predictive model, which is cleaning the dataset. This is another thing that Jupyter Notebooks are well-suited for, as they offer an ideal testing ground for performing dataset transformations and keeping track of the exact changes. The data transformations required for cleaning raw data can quickly become intricate and convoluted; therefore, it's important to keep track of your work. As discussed in the first chapter, tools other than Jupyter Notebooks just don't offer very good options for doing this efficiently.

主站蜘蛛池模板: 安远县| 大庆市| 西畴县| 长武县| 友谊县| 海林市| 灵台县| 琼结县| 张家界市| 比如县| 江津市| 钦州市| 瑞安市| 文成县| 广南县| 宿松县| 清水县| 镇平县| 北票市| 旬阳县| 随州市| 高安市| 望谟县| 淮阳县| 偏关县| 广汉市| 五大连池市| 灯塔市| 同仁县| 怀仁县| 农安县| 长葛市| 临桂县| 科技| 宜兴市| 西平县| 新密市| 元阳县| 内乡县| 营口市| 商洛市|