官术网_书友最值得收藏!

Experiment with hidden layers and hidden units

The most commonly used layers in general neural networks are fully-connected layers. In fully-connected layers, the units in two successive layers are all pairwise connected. However, the units within a layer don't share any connections. As stated before, the connections between the layers are also called trainable parameters. The weights of these connections are trained by the network. The more connections, the more parameters and the more complex patterns can be modeled. Most state-of-the-art models have 100+ million parameters. However, a deep neural network with many layers and units takes more time to train. Also, with extremely deep models the time to infer predictions takes significantly longer (which can be problematic in a real-time environment). In the following chapters, we will introduce other popular layer types that are specific to their network types. 

Picking the correct number of hidden layers and hidden units can be important. When using too few nodes, the model won't be able to pick up all the signals, resulting in a low accuracy and poor predictive performance (underfitting). Using too many nodes, the model will tend to overfit on the training data (see regularization for techniques to prevent overfitting) and won't be able to generalize well. Therefore, we always have to look at the performance on the validation data to find the right balance. In the next recipe, we will show an example of overfitting and output the number of trainable parameters.

Deep FNNs perform well if there is a lot of high dimensional training data available. For simple classification or regression tasks, often a single-layer neural network performs best.
主站蜘蛛池模板: 乐至县| 宣威市| 肃北| 冀州市| 云梦县| 定西市| 巴马| 阜新市| 兴安盟| 江源县| 湘乡市| 上思县| 彭山县| 施甸县| 阿巴嘎旗| 呼和浩特市| 荆门市| 澄江县| 威信县| 绥德县| 神木县| 铜川市| 巴中市| 天镇县| 新绛县| 乌恰县| 黄平县| 宿松县| 体育| 错那县| 项城市| 岱山县| 嘉善县| 库尔勒市| 天等县| 石景山区| 松桃| 庆安县| 开原市| 德格县| 镇江市|