官术网_书友最值得收藏!

Preparing to Train a Predictive Model

Here, we will cover the preparation required to train a predictive model. Although not as technically glamorous as training the models themselves, this step should not be taken lightly. It's very important to ensure you have a good plan before proceeding with the details of building and training a reliable model. Furthermore, once you've decided on the right plan, there are technical steps in preparing the data for modeling that should not be overlooked. 

We must be careful not to go so deep into the weeds of technical tasks that we lose sight of the goal. Technical tasks include things that require programming skills, for example, constructing visualizations, querying databases, and validating predictive models. It's easy to spend hours trying to implement a specific feature or get the plots looking just right. Doing this sort of thing is certainly beneficial to our programming skills, but we should not forget to ask ourselves if it's really worth our time with respect to the current project.

Also, keep in mind that Jupyter Notebooks are particularly well-suited for this step, as we can use them to document our plan, for example, by writing rough notes about the data or a list of models we are interested in training. Before starting to train models, it's good practice to even take this a step further and write out a well-structured plan to follow. Not only will this help you stay on track as you build and test the models, but it will allow others to understand what you're doing when they see your work.

After discussing the preparation, we will also cover another step in preparing to train the predictive model, which is cleaning the dataset. This is another thing that Jupyter Notebooks are well-suited for, as they offer an ideal testing ground for performing dataset transformations and keeping track of the exact changes. The data transformations required for cleaning raw data can quickly become intricate and convoluted; therefore, it's important to keep track of your work. As discussed in the first chapter, tools other than Jupyter Notebooks just don't offer very good options for doing this efficiently.

主站蜘蛛池模板: 浮梁县| 华蓥市| 容城县| 江门市| 正宁县| 扶余县| 淮阳县| 房产| 新泰市| 玛多县| 托克托县| 徐闻县| 榆林市| 凌云县| 盱眙县| 闽侯县| 西华县| 罗山县| 淳安县| 哈尔滨市| 新余市| 库伦旗| 望奎县| 南陵县| 沈阳市| 顺昌县| 塘沽区| 长治县| 南宁市| 开化县| 孟连| 衡东县| 嘉兴市| 甘德县| 定结县| 万年县| 清河县| 江华| 大化| 合肥市| 和硕县|