官术网_书友最值得收藏!

The softmax function

The softmax function is mainly used to handle classification problems and preferably used in the output layer, outputting the probabilities of the output classes. As seen earlier, while solving the binary logistic regression, we witnessed that the sigmoid function was able to handle only two classes. In order to handle multi-class we need a function that can generate values for all the classes and those values follow the rules of probability. This objective is fulfilled by the softmax function, which shrinks the outputs for each class between 0 and 1 and divides them by the sum of the outputs for all the classes:

For examples, , where x refers to four classes.

Then, the softmax function will gives results (rounded to three decimal places) as:

Thus, we see the probabilities of all the classes. Since the output of every classifier demands probabilistic values for all the classes, the softmax function becomes the best candidate for the outer layer activation function of the classifier.

主站蜘蛛池模板: 和静县| 色达县| 镇雄县| 广昌县| 广昌县| 安阳市| 泰州市| 厦门市| 崇左市| 博罗县| 万安县| 黄山市| 定州市| 察哈| 凉山| 娱乐| 渑池县| 迁安市| 新余市| 永济市| 昭平县| 苏尼特左旗| 东宁县| 兴文县| 汾西县| 新泰市| 定安县| 邮箱| 平武县| 邵阳市| 宁强县| 辛集市| 津南区| 榆中县| 贵南县| 二手房| 南汇区| 扬中市| 威宁| 哈巴河县| 东乡|