官术网_书友最值得收藏!

Model architecture for different machine learning problems

The kind of problem we are solving will decide mostly what layers we will use, starting from a linear layer to Long Short-Term Memory (LSTM) for sequential data. Based on the type of the problem you are trying to solve, your last layer is determined. There are three problems that we generally solve using any machine learning or deep learning algorithms. Let's look at what the last layer would look like:

  • For a regression problem, such as predicting the price of a t-shirt to sell, we would use the last layer as a linear layer with an output of one, which outputs a continuous value.
  • For classifying a given image as t-shirt or shirt, you would use a sigmoid activation function, as it outputs values either closer to one or zero, which is generally called a binary classification problem.
  • For a multi-class classification, where we have to classify whether a given image is a t-shirt, jeans, shirt, or dress, we would use a softmax layer at the end our network. Let's try to understand intuitively what softmax does without going into the math of it. It takes inputs from the previous linear layer, for example, and outputs the probabilities for a given number of examples. In our example, it would be trained to predict four probabilities for each type of image. Remember, all these probabilities always add up to one.
主站蜘蛛池模板: 海兴县| 且末县| 普定县| 大兴区| 莱西市| 徐水县| 普安县| 惠东县| 镶黄旗| 益阳市| 邢台市| 屯昌县| 嵩明县| 金溪县| 怀化市| 察雅县| 故城县| 如皋市| 常宁市| 弋阳县| 汉寿县| 丹江口市| 涪陵区| 镇雄县| 同心县| 特克斯县| 大关县| 康定县| 治县。| 集贤县| 黑龙江省| 新乐市| 长子县| 澄城县| 射洪县| 元谋县| 济源市| 泰顺县| 张家界市| 宁海县| 甘泉县|