- Deep Learning with R for Beginners
- Mark Hodnett Joshua F. Wiley Yuxi (Hayden) Liu Pablo Maldonado
- 294字
- 2021-06-24 14:30:40
Summary
This chapter showed how to get started building and training neural networks to classify data, including image recognition and physical activity data. We looked at packages that can visualize a neural network and we created a number of models to perform classification on data with 10 different categories. Although we only used some neural network packages rather than deep learning packages, our models took a long time to train and we had issues with overfitting.
Some of the basic neural network models in this chapter took a long time to train, even though we did not use all the data available. For the MNIST data, we used approx. 8,000 rows for our binary classification task and only 6,000 rows for our multi-classification task. Even so, one model took almost an hour to train. Our deep learning models will be much more complicated and should be able to process millions of records. You can now see why specialist hardware is required for training deep learning models.
Secondly, we see that a potential pitfall in machine learning is that more complex models will be more likely to overfit the training data, so that evaluating performance in the same data used to train the model results in biased, overly optimistic estimates of the model performance. Indeed, this can even make a difference as to which model is chosen as the best. Overfitting is also an issue for deep neural networks. In the next chapter, we will discuss various techniques used to prevent overfitting and obtain more accurate estimates of model performance.
In the next chapter we will look at building a neural network from scratch and see how it applies to deep learning. We will also discuss some methods to deal with overfitting.
- 數據分析實戰:基于EXCEL和SPSS系列工具的實踐
- Creating Mobile Apps with Sencha Touch 2
- 從0到1:數據分析師養成寶典
- 企業大數據系統構建實戰:技術、架構、實施與應用
- WS-BPEL 2.0 Beginner's Guide
- MySQL 8.x從入門到精通(視頻教學版)
- PostgreSQL指南:內幕探索
- 大數據治理與安全:從理論到開源實踐
- 數據應用工程:方法論與實踐
- PostgreSQL高可用實戰
- 大數據測試技術:數據采集、分析與測試實踐(在線實驗+在線自測)
- Redis 6開發與實戰
- SQL Server 2012數據庫技術及應用(第4版)
- Unity 4.x Game AI Programming
- Learning Game AI Programming with Lua